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BLAHK 


INTRODUCTION 


The  Meeting  on  Solute/Solvent  Interactions  represents  the  first  such 
endeavor  at  the  U.S.  Army  Chenucal  Research,  Development  and  En^eering 
Center  (CRDEC).  All  told,  there  were  16  presenters  from  CRDEC,  Naval 
Research  Laboratory,  Environmental  Protection  Agency,  and  academic 
institutions.  In  addition,  four  international  scientists  were  in  attendance. 

This  meeting  was  modeled  around  the  'Laguna  Beach'  meetings  organized 
by  Mortimer  J.  Kamlet  in  the  mid  to  late  IdSOs.  In  those  meetings,  as  well  as 
these,  there  formal  presentations  by  each  of  the  invited  attendees.  In  addition, 
there  were  numerous  periods  for  informal  discussions.  This  report  contains 
the  written  version  of  most  of  the  presentations. 

Over  the  two  day  period,  the  presentations  were  divided  into  4  basic 
session  areas:  Equilibrium,  Theoretical,  New  Approaches,  and  Polymers. 
Many  topic  areas  applicable  to  solute/solvent  interactions  were  covered.  Most 
prevelant  were  applications  invloving  the  generalized  linear  solvation  energy 
relationship  (LSER),  and  new  methods  to  measure  the  necessary  parameters. 
In  addition,  theoretical  approaches  were  presented  that  apply  both  ah  initio 
and  semi-empirical  quantum  chemical  methodolo^es  to  the  LSER.  Two  new 
computer  programs  were  also  presented  dealing  with  the  prediction  of 
solute/solvent  properties.  The  furst,  QSAR,  is  a  compilation  and  thesaurus  of 
quantitative  structiue  activity  relationships  (QSAR)  compiled  over  the  years. 
The  second,  SPARC,  is  an  attempt  to  provide  relevant,  accurate  predictions  of 
properties  for  EPA  related  projects. 

Finally,  I  would  like  to  thank  ail  of  the  attendees  to  donating  their  time  to 
share  their  research.  Without  their  contributions,  this  meetii^  would  not  have 
been  possible. 


George  R.  Famini 
Meeting  Chairman 
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Meeting  on  Solute/Solvent  Interactions 
Wednesday,  29  May,  1991 

Inherent  Structural  and  Solvent  Effects  on  Proton  Transfer  and  Hydrogen- 

Bond  Addities 

R.W.  Taft,  Department  of  Chemistry,  University  of  California,  Irvine 

Recent  ICR  studies  of  gas  phase  addities  in  our  laboratory  have  been 
extended  to  indude  (neutral  organic)  proton  transfer  adds  of  very  great 
addity.  In  particular,  the  trifilate  substituent,  SO2CF3,  and  its  derivatives 
have  been  investigated.  The  gas  phase  proton  transfer  addities  can  be 

compared  to  those  in  dimethyl  sulfoxide.  The  introductory  overhead 

AG** 

illustrates  the  great  differences  that  are  found  between  pK^  =  (HA 

+  A*;  AG®)  in  the  gas,  dimethyl-sulfoxide  (hereafter  given  by  s  or  Me2SO) 
and  aqueous  phases  for  familiar  adds.  The  very  large  differences  of  pK® 
valu^  in  solution  phases  compared  to  gas  phase  result  from  the  reaction  of 
H'*’  with  solvent  and  the  ion  solvation.  However,  very  significant  ApKa 
differences  for  pair  wise  comparisons  of  adds  (the  corresponding  proton 
transfer  equilibria)  are  also  to  be  noted  between  the  three  different  phases. 
These  ApKa  differences  in  solution  compared  to  the  corresponding  gas  phase 
ApKa  reveal  the  differentials  between  the  free  energies  of  solvation  of  anions 
and  neutral  adds.  For  different  families  of  adds  the  ApKa  values  frequently 
are  widely  different,  whereas,  at  least  qualitatively,  it  is  less  common  to  find 
different  addity  orders  within  a  given  family. 

These  generalities  are  illustrated  in  the  second  overhead  for  several 
series  of  trifflated  and  related  adds  by  the  gas  and  dimethylsiilfoxide  phase 
pKa  values.  Also  illustrated  is  the  fact  that  successive  substitutions  into 
methane  can  product  much  stronger  acids  than  the  corresponding  single 
substitution  in  the  water  molecule.  For  example,  (CF3S02)3CH  is  4.6  powers 
of  ten  stronger  than  trifilic  add  (CF3SO2OH)  in  the  gas  phase.  An  interesting 
inversion  from  the  intrinsic  gas  phase  order  is  shown  for  the  addities  of 
(CF3S02)2CH2  (the  stronger)  and  (CF3S02)2NH  (the  weaker)  in  Me2SO.  A 
probable  explanation  is  given  in  the  third  overhead.  Such  inversions  are 
common  for  general  comparisons  of  pKa(gas)  vs  pKa(s).  In  order  to  help 
single  out  the  differentials  in  the  solvation  energies  of  a  given  HA /A”  pair, 
Taft  and  Bordweil  (reference  given  in  the  first  overhead)  have  used  the 
proton  transfer  equilibria  shown  in  overhead  4,  in  which  9-C6H5  fluorene 
and  its  conjugate  anion  represent  a  series  of  very  poorly  non-specifically 
solvated  add/base  pairs.  These  reference  unsaturated  hydrocarbons  have  no 
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heteroatoms  and  have  highly  charged  deloasdized  carbanions^  a  structural 
feature  that  leads  to  no  ^fect  of  MeaSO  on  their  relative  additira  (relative 
addities  are  the  same  far  gas  and  Me2SO  phase).  Plots  of  ApKaCg)  (or  MG^’Cg}) 
vs  ApKa(s)  (or  AAG°(s))  for  the  9-Ph  fluorene  reference  then  reveal  HA/A' 
specific  solvation  effecte  as  deviations  from  the  hiic  of  unit  slope  (overhead 
4)).  When  A*  solvation  by  dipolar  (but  non-HBD)  Me2SO  exceeds  dipolar  and 
AH“OSMe2  H“bonding  solvation  of  iiS  conjugate  add,  large  deviations  to  the 
right  of  the  line  of  unit  slope  are  observed  (as  for  H2O  or  CH3OH,  for 
example).  The  pointe  for  adds  that  lie  to  the  left  of  the  line  of  unit  slope  (eg 
CF3SO2OH),  involve  relatively  strong  H-bonding  solvation  of  HA  and 
decreased  anionic  stabilization  by  dipolar  substituents  that  occurs  in  dipolar 
solvents  as  compared  to  the  gas  phase. 

The  equilibrium  in  a  gas  phase  proton  transfer  reaction  is  determined 
only  by  intrinsic  structural  effects  (there  are  no  counter  ion  or  solvation 
effects).  The  corresponding  solution  phase  reaction  has  superimposed  upon 
the  main  proton  transfer  equilibrium  a  set  of  solvation  equilibria.  Hydrogen* 
bonding  equilibria  of  the  t5q>e  AH  +  OSMe2  *1  AH-  OSMe2  are  important 
contributors  to  pKa(s)  values  and  to  the  direction  and  magnitude  of  the  kinds 
of  deviations  illustrated  in  overhead  4.  This  follows  from  the  fact  that 
structural  dfects  on  proton  transfer  addity  and  corresponding  hydrogen-bond 
donor  addities  in  general  are  simply  unrelated.  This  is  shown  in  the  highly 
disordered  plot  of  overhead  5  which  compares  currently  available  values 

AG®ha  *■ 

with  corresponding  available  gas  phase  proton  transfer  - - 

values.  Even  within  families  of  adds,  deviations  from  a  linear  relation 

AG®ha  ■’  AG®hi  h 
between  a  corresponding  series  of  - 13^ -  ®  2' 


common.  Clearly  the  structural  features  that  particularly  stabilize  anions  are 
nearly  totally  missing  for  corresponding  AH  -B  hydrogen-bonded  complexes. 
We  must  exdude  hydrogen-bonded  ion  pairs  from  this  condusion  since  their 
structural  and  solvent  stabilizations  are  affected  in  a  totally  different  manner 
than  for  the  corresponding  HB  complexes. 

One  of  the  imjKirtant  kinds  of  anion  stabilization  that  occurs  in  both  HB- 
D  and  in  certain  non-HB-D  solvents  (such  as  Me2SO)  has  been  called 
substituent  solvation  assisted  resonance  (SSAR)  effects.  Overhead  6 
illustrates  these  effects  for  the  proton  transfer  addities  of  a  series  of  m-  and  p- 
substituted  phenols  relative  to  phenol.  Pi  electron  delocalization  from  the 
anionic  center  (here  phenoxide)  to  certain  (SSAR)  para  pi  electron  acceptor 

substituents  create  new  solvation  sites  in  these  "ambident"  anions. 
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Substituent  site  solvation  by  this  electronic  mechanism  requires  significant 
localization  of  pi  electronic  charge  at  a  solvent  accessible  electronegative  atom 
of  the  substituent.  Sharing  of  the  delocalized  charge  among  many  such  atoms 
reduces  the  substituent  solvation  to  the  point  that  it  may  be  negated  by  the 
entropy  loss  contribution  to  solvation  free  energies.  Accordingly  the  SSAR 
substituents  in  deaeasing  order  of  importance  are;  p-NO,  p-N02/  p-SOaCFa,  p- 
CH3CO  P-CO2CH3,  CN.  Subsdtuents  such  ns  p-CFa,  p-SCFs,  p-SCeHg,  p- 
SOCH3,  P-SO2CH3  and  all  meta  substituents  have  been  recognized  as  non- 
SSAR. 

Very  recently  completed  work  with  Yagupol'skii's  superacceptor 
substituent  in  the  p-position  of  aniline  is  shown  in  overhead  7.  Koppel's 
group  has  determined  the  corresponding  pKa(s)  for  this  highly  unusual 
aniline.  In  a  plot  of  ApKa(g)  vs  ApKaCs)  for  m-  and  p-substituted  aruiines  (not 
shown)  SSAR  effects  are  significantly  larger  than  those  shown  for  the 
corresponding  phenols  in  overhead  6.  The  superacceptor  substituent 
S0(CF3)NS02CF3  has  a  very  large  op  value  (1.09)  and  a  quite  significant 
value  (.31).  view  of  the  number  of  resonance  forms  that  may  be  written  in 
which  pi  electronic  charge  is  delocalized  from  NH"  of  the  anilide  ion  to 
electronegative  atoms  of  this  substituent,  it  is  very  reasonable  to  expect  and  it 
is  found  in  the  ApKa(g)  vs  ApKa(s)  plot  that  this  para  superacceptor  gives  the 
same  magnitude  of  SSAR  effect  as  does  the  para  SO2CF3.  For  the  trifilate 
substituent  the  op  ^d  values  (.84  and  .21,  resp.)  are  both  smaller. 

The  structures  that  lead  to  gas  phase  superacidities  (that  is,  AGacid  values 
of  less  than  300  kcal/ mole  or  pKa  less  than  220)  have  many  potentially  useful 
properties.  One  example  comes  from  the  work  of  Profs.  D.D.  Des  Marteau  and 
E.  Yeager.  The  substructure  shown  in  overhead  8  has  been  incorporated  into 
polymers  and  these  are  found  to  have  unusually  good  electrical 
conductivities. 
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2  ,  rue  de  la  Houssiniyre 

44072  NANTES  CEDEX  03 

FRANCE 


Hydrogen  bonding  basidty  is  ot  the  major  intrinsic  properties  ot  a  solvent  > 
together  with  its  polarity  and  its  hydrogen  bonding  acidity  .  Corresponding  ^  »lt*’and 
scales  have  been  carefully  elaborated^^^  and  the  multilinear  solvatochromic  equation  : 

m  »  OCYBq  +  iTf  *+  a0(  +  bg 

has  proved  useful  in  rationalizing  solvent  effects  in  many  physico-chemical  properties  and 
in  organic  as  well  as  bio-organic  reactivities  .  However  ,  with  practice  >  the  Linear 
Solvation  Energy  Relationships  have  brought  to  light  some  limitations  of  the  available 
data  in  the  field  of  hydrogen  bonding  and  have  necessitated  statistical  co-additions  of 
various  measurements  involving  several  probes  >  different  temperatures  and  many 
solvents. 

Since  1981  >  we  have  been  interested  in  overcoming  one  aspect  of  these  limitations  : 
the  lack  of  a  general  H,B.  basidty  scale  of  solute  molecules  .  Two  different  thermodynamic 
(  pKhs  )  and  spectroscopic  (A9(OH)  )  HJB.  scales  of  solute  bases  have  been  simutaneously 
re-examined  and  developed  by  means  of  FTIR  spectroscopy  .  The  definition  of  these  scales 
is  outlined  as  -■ 


p-fluorophenol  +  Bast  Complex  ( CCI4  25*0 

K  «  (Complex)  /  (p-fluorophenoU  (Base) 
pKrb  » log  K 

MeOH  +  Base  MtOH*"Base  ( CCI4  20*C) 

A^(0H)»  i?(OH)  -  yfOK-) 
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THE  SPECTROSCOPIC  SCALE 


M*ny  iirthors  have  already  suggested  that  the  -frequency  shift  of  a  reference  RXH  (fig 
3)  could  provide  a  convenient  evaluation  of  the  H.B.  strength  .  However  i  the  selection  of 
the  probe  is  of  primary  importance  because  : 

-  the  relationships  between  the  different  adds  may  be  family  dependent  (fig  4)  . 
Therefore  >  special  care  must  be  exerdsed  when  averaging  data  from  several  probes  . 

-  phenols  must  not  be  used  as  reference  adds  »  since  it  is  impossible  to  get  the 
maximum  of  their  complexed  absorption  with  all  amines  and  with  most  pyridines  (fig  5) . 

We  have  selected  methanol  which  gives  accurate  shifts  from  20  cm"^  for  halogen  and  '1^' 
bases  to  480  cm'^  for  the  strongest  nitrogen  bases . 

The  (OHl  scale  is  found  to  be  very  sensitive  to  the  covalent  part  of  the  hydrogen 
bonding  interaction  (fig  6)  and  practically  insensitive  to  steric  effects  as  shown  in  fig  7 
where  ortho  *  meta  ,  and  para  substituted  pyridines  fall  on  the  same  line  and  where  the 
only  significant  deviations  are  due  to  water  solvation  on  some  specific  substituents  . 
Another  interesting  feature  of  this  scale  is  its  ability  to  detect  the  original  behavior  of 
some  oxygen  acceptors  which  may  present  two  stereoisomeric  complexes  (fig  8) . 

THE  THERMODYNAMIC  SCALE 

Our  purpose  was  to  extend  the  scale  defined  by  Taft  and  co-workers  in  19i9  (fig  9) .  We 
have  now  examined  the  characteristics  of  more  than  400  bases  ►  the  ranges 

(fig  10)  from  -0^5  to  •*■£<£ . 

The  structural  effects  on  this  thermodynamic  scale  may  be  analysed  in  terms  of  the 
nature  of  the  accepting  atom  and  of  the  substituents  linked  to  this  group  .  A  strong 
basidty  requires  two  characteristics  of  the  molecule  : 

-  the  accepting  atom  must  have  a  high  p  character  in  order  to  favor  the  charge 
transfer  interaction , 

-  the  molecule  must  have  polar  bonds  in  order  to  get  a  strong  electrostatic 
stabilisation  . 

The  predominance  of  the  latter  effect  explains  the  order  of  basicity  presented  in  fig  1 1 
for  nitrogen  acceptors . 

Inside  the  different  families  of  bases  *  the  substituent  effects  -  steric  ,  resonance  , 
field/inductivf  ,  proximity  ,  cyclisation  ,  alKyl  effects  -  play  their  usual  roles  (fig  12  -  fig 
15)  .  The  optimum  blend  of  some  of  these  effects  is  found  in  a  cyclic  amidine  (fig  15)  which 
is  the  strongest  molecjlar  base  known  . 
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THE  pX{{s  vs  A^<0H}  plot 

Uhtntvtr  iht  ruturt  of  tht  accipting  stom  or  its  sUtt  of  hybridization  is  changed  a 
new  family  is  defined  provided  that  no  steric  effect  occurs  from  the  substituents  (fig  16) . 
Very  important  new  families  art  presented  such  as  *■ 

-  monomeric  water  and  alcohols  (fig  17) » 

-  nitro  and  nitroso  derivatives  (fig  18)  including  some  solvatochromic  indicators  and 
siqjer  basic  nitro  compounds  • 

-  aromatic  >  ethylenic  and  acetylenic  families  (fig  1?) » 

-  halogen  bases  (fig  20) » 

-  anions  (figs  21-’22> :  in  opposition  to  neutral  bases  these  anions  form  a  homogeneous 
family  whatever  the  nature  of  the  accepting  group . 

For  a  selected  set  of  22  basts  i  the  (OK)  plot  and  the  graph  between  F2  and 

FI  vectors  as  tatcen  from  the  study  of  (jal  and  co-^orKers  are  found  to  be  equivalent  (figs 
23  and  24) . 

THE  POLYFUM(rnOMAUTY 

Most  molecules  of  biological  interest  art  potentially  polyfunctional  toward  HJB.  donors 
because  they  bear  many  insaturatians  or  heteroatoms  (tig  25)  •  The  polyfunctionality  may 
be  undentcored  on  the  7  (OH)  spectrum  of  the  add  (fig  26)  and  in  some  favorable  cases  in 
the  spectrum  of  the  base  (fig  27)  .  When  the  constant  is  obtained  from  the  intensity  of  the 
fret  absorption  of  the  add  «  it  is  the  sum  of  the  microscopic  constants  related  to  the 
different  sites  of  the  molecule  .  These  microscopic  constants  may  be  obtained  with  a  good 
accuracy  from  our  pK^g/  A7  (OH)  plot  and  their  sums  are  in  good  agreement  with  the 
experimental  ones  (figs  28  and  28)  .  With  the  help  of  this  diagram  ,  we  have  been  able  to 
draw  general  rules  concerning  the  polyfunctionality  : 

-  the  first  rule  is  that  the  proton  sharing  process  is  less  selective  than  the  proton 
transfer  .  All  insaturations  and  heteroatoms  usually  behave  as  acceptor  sites  (fig  30) . 

-  the  second  rule  concerns  push-puU  molecules  .  When  direct  conjugation  occurs  between 
two  functional  groups ,  the  electron  donating  one  is  inactive  toward  H3.  donors  i  whatever 
the  nature  of  the  transmitting  group  (fig  30)  .  These  rules  may  be  applied  on  many 
molecules  of  biological  interest  (fig  31) . 


15 


1 


HYDROGEN  BONDING 
BASICITY 
OF 

SOLUTE  MOLECULES 
IN 

TETRACHLOROMETHANE 


F,TLR,  SPECTROSCOPY 


SOLVENT  :  CCU 


FREQUENCY  SHIFT  OF  A 
REFERENCE  ACiD  RX-H 

Av(XH)=  v(XH)-v(XH.,.) 

SPECTROSCOPIC  SCALE 


EQUIUBRIUM  CONSTANT  OF  THE 
H.B,  COMPLEX  FORMATION 


(Complex) 


(Acid)(Base) 


THERiiODYNAMIC  SCALE 


2 


16 


sbtorbaiwa 


Rgur.  16.  Spectr*  IR  du  p-fluoroph6nol  (O.OMS  M}  m  pr<s«m:*  doxyiit  ifo  pro 
pylina  (0.1360  Ml. 


AH-'-B 


AH  +  B 

(AH...B) 

(AH)(B) 


CCI+ 


298  K 


pKiffl  =  log  K  (p-fluorophenol) 


Z7az  36S0  3608  3550  3588  3458  3408  3358  3300 


VP.VENUH8ER  CH-1 


20 


p  -«.85 


STERIC  EFFECT 


FIELD/  INDUCTIVE  EFFECT 


AROMATIC  -  ETHYLENIC  -  ACETYLENIC 


mm 

hSU 

■ 

lommiiiissHi 

HH^IIIli 

HI 

HHHHi 

■■IB 

gesiM 

HALOGEN  BASES 


—  1.0  irri  iTTi'i  iTi  I  n  rrr\  |  ri  r;  n  n-T  )Tri  i  iTi  i  r  i  1 1 1 1 1  m  1 1 1 1  m  1 1 1 1  iri 

0  10  20  30  40  50  60 

Av(OH) 


ANiONS 


F2  vs  Ft 


a  K,  ♦  K,  «  7^  tjnol"' 


BLANK 


32 


Two  main  topics  ware  discussed: 


I }  Tha  comparison  between  ( 1 )  Principal  Component  Analysis 

(PCA)^^’  and  (ii)  Empirical  Methods‘^’  for  the  study  of 
m»llua  effects  on  solute  propeartles. 

II)  The  influence  of  solvent  "polarizability”  on  the  properties 
of  solutes . 


I)  Consider  a  property  P  of  a  solute  in  a  series  of  solvents  (Sg, 
Si, . }.  Within  the  PCA  methodology,  P,  can  be  expressed  as: 

P,  .  Pg  +  S  Ai  (1) 

Empirically,  one  has: 

-  V  S  Cj  (2) 

Pg  and  P'g  are  the  values  taken  by  P  in  reference  solvents. 

The  fundamental  mathematical  difference  between  these  two 
methods  lies  in  the  fact  that  the  vectors  6i  are  mutually 
orthogonal  while,  in  general,  the  vectors  Pj.  are  not. 

In  order  to  compare  both  methods  it  was  decided  to  study 
medium  effects  on  the  mot,  IR  and  UV-Vlsible  spectra  of  a  variety 
of  a  variety  of  solutes,  using  the  following  "rules  of  the  game" : 

a)  Given  the  variety  of  empirical  scales  of  "solvent  effects" 
currently  available,  the  Kamlet>Taft  descriptors  of  dipolarity* 
polarizability  (n*),  hydrogen- bending  acidity  (a)  and  hydrogen- 
bonding  basicity  (S)  were  chosen  as  the  empirical  f^s. 
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b)  The  trealanent  would  ba  "naive",  that  is,  in  all  cases,  the  a* 
parameters  would  be  used  at  face- value,  no  use  being  made  of  the 
dd  correction  term,  generally  appropriate  for  aromatic  and 
polyhalogenated  solvents*  We  knew  that  this  would  generally  lower 
the  quality  of  the  correlations  through  aquation  (2)  but  it  is 
a  fact  that  this  is  what  many  "non-specialist"  users  do. 

c)  The  solvents  used  in  this  study  were  selected  according  to  the 
following  criteria: 

-  They  had  to  span  the  widest  possible  range  of  n*,  a  and  B 
values.  The  only  limitations  being  dictated  by  solubility 
and/or  spectral  interferences. 

-  For  any  given  number  of  solvents  their  information  contents 
should  be  aasimal. 

The  corrsponding  sets  {«*},  {a}  and  {B}  were  to  be  mutually 
otthQQonal  (in  practice,  r^  <  0.06). 

A  set  of  17  solvents,  meeting  these  requirements  was 
"constructed  by  means  of  the  NEMROD  package^’’.  They  are  listed  in 
Table  1. 

We  notice  that  spectral  properties  were  chosen,  among  other 
reasons,  because  they  avoid  the  complications  Inherent  to  cavity 
effects. 


Also  for  comparison  purposes,  we  have  used  Relchardt's  Er*'*’ 
and  Swain's  "acity"  (A)  and  "basity" 

The  spectral  properties  studied  were  as  follows: 

1}  NMR  Spectra 

”C  Chemical  Shifts  of;  Methyl  phenylsulf oxide  (A),  Tropone 
(B),  Pyridine  (C),  4-Methylpyridine  (0),  4- 

Oimethylaminopyrldine  (E),  4-Acetylpyridine  (F),  4- 

Cyanopyrldine  (G),  Chlorobenzene  (H),  a,a,a- 
Trifluoromethylbenzone  (i),  Tropolone  (J),  DMSO  (K). 
Carbon  atoms  in  these  compounds  are  numbered  in  Chart  X. 

34 


All  chenlcal  shifts  in  a  solvent  S  are  refexrred  to  that  of 
the  msta  carbon  of  (I)  and  are  defined  through  equation 
(3). 

<8  -  (6CJs  (3) 

#  Cn 


Chcwical  shifts.  Two  cases  were  exanlnad: 

Tr iphenylphosphlne  oxide  (L)/ref erred  to 
trlphenylphosphite . 

Trlnethyl  phosphate  (M)/referred  to  triisopropyl  phophite 

2)  IR  sceetra.  Frequencies  v  of  the  following  stretching 
bands  were  examined: 

CO  [in  CFjCOOMe  (N),  CCl,COOMe  (0),  CH,C0CH,  (P)  and  HCONMe, 
(R)],  CM  in  n-C3K,-CN  (Q)  and  SO  in  mso  (K). 

3)  UV-Visible  Spectra.  Frequencies  of  ine  longest  wave¬ 
length  absorption  maximum  for  the  following  ccxapounds  were 
determined: 

4-Nltroanlline  (S),  N-Methyl-4-nitroanillne  (T),  N,M- 
Diaethyl-4-nitroaniline  (U ) ,  N,  N, -Diethyl-4-nitroaniline 
(V),  4-Nitrophenol  (W),  4-Nitroanisole  (X),  3,4- 

Oinitrophenol  (Y)  and  8-carotene  (Z). 

We  are  aware  of  the  fact  that,  complicating  factors  are 
present  in  sooe  of  the  IR  data,  particularly:  (1)  Fermi  resonance 
and  (ii)  features  related  to  the  various  extents  of  hydrogen¬ 
bonding  association  to  the  solvent.  In  the  case  of  the  UV- Visible 
Spectra,  the  existence  of  residual,  variable  vibrational 
structure  introduces  uncertainties  in  the  determination  of  v.,^. 

These  facts  were  not  taken  into  account,  being,  as  they  are, 
representative  of  the  actual  situations  prevailing  in  many 
studies  of  "Solvent  effects”. 

These  sets  of  data  were  examined  by  means  of  equations  (1) 
and  (2).  More  precisely: 

-  Empirical  treatments  Include  correlations  with  (re*, a,  S), 
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B,"  axid  (A,B}. 

-  PCA  ware  carried  out  with  up  to  six  conponentes. 

The  results  of  these  analyses  are  suasiarized  in  Tables  IZ- 

IV. 

Relevant  conclusions  derived  therefrom  are  as  follows: 

1}  The  Kaalet-Taft .  formalism,  even  in  its  simplest  form, 

generally  performs  better'  than  the  other  empirical  methods. 

2)  For  properties  displaying  a  significant  sensitivity  to  the 
solvent.  It*,  a  and  fi  account  in  general  for  at  least  95%  of 
the  variance.  A  case-by-case  study  of  the  various 
correlations  thusly  generated  shows  that  they  "make  good 
physical  sense”. 

3)  For  the  sme  properties,  variables  8^  to  8,  explain  the 
following  average  percentages  of  the  variance:  8^,  57.6%; 
8i  and  e,,  90%;  e^  to  e,,  95.02%;  to  8*,  96.22%;  Sj  to  e,, 
97.15%  and  8^  to  8„  98.02%.  It  is  clear  that  the 
contributions  from  84,  8^  aru3  8,  are  rather  marginal. 

4)  Most  important,  under  the  same  conditions  of  orthogonality, 
the  quality  of  the  correlations  obtained  with  (n*,  a,  S)  is 
quite  ccsBparable  to  and  often  better  than  that  obtained 
with  8j,  8,  and  8;.  This  is  remarkable  because,  indirectly, 
the  8's  account  up  to  some  extant  for  the  "specificity"  of 
aroswtic  and  polyhalogenated  solvents. 

5)  It  is  of  Interest  that,  while  an  analysis  in  terms  of  n*,  a 
and  B  can  be  visualized  in  terms  of  physical  interactions, 
this  is  not  the  case  for  the  8's.  The  latter  can  in  fact  be 
expressed  as  "blends”  of  the  Kamlet-Taft  parameters.  For 
example: 

8i  -  (-0.603  ±  0.034)  -  (0.519  ±  0.047)«*  +  (0.421  ±  0.041)0  + 

(1.691  ±  0.023}a 

r^  -  0.991;  sd  -  0.11 
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6) 


Frc»  a  praotical  standpoint,  it  would  sees  that,  for 
pradlctlva  purposaa,  if  ona  wishas  to  predict  t^ia  value 
taken  by  som  solute  property  P  in  a  given  solvent,  oxu:» 

( 1 }  several  cognate  properties  have  been  studied  in  a 
variety  of  eedia,  including  the  solvent  of  interest  and, 
(ii>  P  has  already  been  determined  in  several  of  these 
solvents,  a  PCA  treatnent  seees  quite  convenient. 

on  the  other  hand,  if  one  is  interested  in  predicting  P  in 
solvents  not  covered  by  these  preliminary  studies  and/or  if 
an  insight  into  the  aeehanisa  of  solvent*aolute  interaction 
is  being  sought,  use  of  the  Kasd.et<>Taft  foraalisa  seems 
warranted. 

7}  Last  we  notice  that  iMither  PCX  nor  the  empirical  scales 
should  be  applied  outsicto  their  range  of  physical  validity. 
This  is  the  ease  for  in  the  UV-Vislble  spectrum  of  8- 
carotene.  Here,  correlations  either  with  (n*,  a,  6)  or  (6^, 
8,  and  6])  are  poor* 

This  is  so  because,  as  sho«m  by  Abe  medium  effects  on 
this  property  are  essentially  determined  by  n,  the 
refraotive  Indes  of  the  solvent.  The  properties  esamined  in 
this  work  are  much  less  sensitive  to  changes  in  n,  and  so 
are  8^,  8,  and  8,.  The  physical  contributions  to  n*,  a  and  8 
are  well  known  end  immediately  explain  their  limited 
\isefulness  in  this  case. 
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CHART  I 
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TABLE  I 


Solvents  usad  in  the  comparison  PCA/EM 


Cyclohexane 

Triethylamlne 

Trlbutylamlne 

Tetrachlorcsaethane 

Diethyl  carbonate 

Toluene 

Acetonitrile 

2,  6-Oliiiethylpyridine 

Dlchlorcmethane 

DiflM  thy Isul foxide 

Nitrobensene 

Tert>butanol 

Butanol 

Methanol 

Ethylene  glycol 

2,  2,  2~Tri£luoroethanol 

1,  1,  1,  3,  3,  3-Hexa£luoro>2-propaz)ol 
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Raaardlna  Tables  II  and  111 

1.  Thasa  ITablas  era  taken  froa  the  original  text  of  a  paper  to 
appear  in  French  in  the  Canadian  Journal  of  Cheisiatry.  Thus, 
coauaas  stand  for  decimal  points. 

2.  Values  of  A  and  B  are  not  available  for  all  the  solvents 
studied  in  this  work.  Values  in  parentheses  are  for  correlations 
limited  to  solvents  for  which  these  values  are  available. 

3.  STB  stands  for  the  standard  deviation  of  the  correlation. 
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Table  il  .Correlations  using  NMR  Data 


rt*t  a 

.  3 

*4 

K  , 

8 

Solute  p 

t* 

ETC 

c* 

BT£ 

z* 

ETC 

z* 

ETS 

K 

S' 

0.987(0, 981) 

0,117(0,061) 

0,782 

0,424 

0,898 

0,  ISO 

0,987  , 

,1  16. 

0,960(0,871) 

0,083(0,077) 

0.6X1 

0,242 

0,745 

0,102 

0,954 

0,088 

Sc, 

0,920(0,846) 

0,109(0,116) 

0,277 

0,308 

0,752 

0,140 

0,929 

0,103 

S' 

0,982(0.955) 

0,340(0,386) 

0,950 

0,530 

0,962 

0,334 

0,393 

0,2X3 

Sc, 

0,900(0,802) 

0,411(0,388) 

0,914 

0,331 

0,877 

0,290 

0,953 

0,279 

a 

8c. 

0,984(0,961) 

0,196(0,187) 

0,902 

0,448 

0,959 

0,1S3 

0,993 

0,125 

S‘ 

0,985(0.976) 

0,284(0,269) 

0,937 

0,451 

0,979 

0,235 

0,980 

0,327 

S’ 

0,724(0,834) 

0,355(0,312) 

0,446 

0,504 

0,831 

0,299 

I  0,774 

0,346 

sc, 

0,936(0,975) 

0,219(0,201) 

0,901 

0,549 

0,930 

0,321 

1  0,963 

0,362 

c 

Sc. 

0,950(0,912) 

0,228(0,221) 

0,934 

0,245 

0,913 

0,208 

0,983 

0,134 

S’ 

0.971(0,956) 

0,096(0,098) 

I  0,861 

0,224 

0,913 

0,131 

0,970 

0,112 

fic, 

0,992(0,984) 

0,122(0,122) 

0,918 

0,372 

0,971 

0,158 

0,994 

0,107 

0 

sc, 

0,919(0,932) 

0,380(0,210) 

0,871 

0,447 

0,939 

0,190 

0,965 

0,251 

S’ 

0,974(0,958) 

0,110(0,086) 

0,797 

0,286 

0,888 

0,134 

0,977 

0,104 

Sc, 

0,973(0,987) 

0.400(0,156) 

0,857 

0,862 

0,982 

0.175 

0,988 

0,265 

e 

«C. 

0,933(0,933) 

0,376(0,283) 

0,873 

0,478 

0,936 

0,261 

0,961 

0,281 

Sc, 

0,685(0,58  )) 

0,115(0,110) 

0,057 

0,184 

0,619 

0,101 

0,736 

0,105 

SC, 

0,938(0,8*'/) 

3,313(0,181) 

0,627 

0,718 

0,830 

0,211 

0,962 

0,247 

r 

sc. 

0,960(0,924) 

0,128(0,128) 

0,949 

0,134 

0,948 

0,101 

0,990 

0,063 

Sc, 

0,981(0,947) 

0,093(0,108) 

0,741 

0,318 

0,170 

0,162 

0,979 

0,096 

fic, 

0,970(0,959) 

0,123(0.112) 

0,592 

0,438 

0,941 

0,127 

0,955 

0,156 

Q 

S‘ 

0,909(0,809) 

0,165(0,154) 

0,882 

0,174 

0,886 

0,112 

0,961 

0,107 

S’ 

0,976(0,943) 

0,103(0,100) 

0,819 

0,262 

0,899 

0,126 

0,964 

0,882 

Sc, 

0,431(0,575) 

0.560(0.458) 

0,004 

0,715 

0,651 

0,394 

0,496 

0,546 

H 

Sc, 

0,880(0,921) 

0.042(0,026) 

0,534 

0,077 

0,804 

0,039 

0.930 

0.033 

Sc, 

0,781(0,820) 

0,151(0,149} 

0,220 

0,265 

0,867 

0,122 

0,872 

0,115 

sc, 

0,826(0,883) 

0.034(0,021) 

0,376 

0,061 

0,717 

0,032 

0,926 

0,022 

«C. 

0,843(0,835) 

0,279(0,302) 

0,200 

0,587 

0,818 

0,252 

0,907 

0,215 

1 

8(C.-C.J 

0,943(0,956) 

0.016(0,081) 

0,206 

0,292 

0,902 

0,115 

0,933 

0,091 

5(C,-C.) 

0,943(0,962) 

0,119(0,109) 

0,212 

0,410 

0,927 

0,144 

0,949 

0,111 

6(C,-C,) 

0,749(0,826) 

0,091(0,080) 

0,166 

0,138 

0,102 

0,061 

0,799 

0,073 

J 

Sc, 

0,965(0,9041 

0,292(0,312) 

0,831 

0,591 

0,852 

0,366 

0,972 

0,261 

Sc, 

0.960(0,872) 

0,240(0.269) 

0.719 

0,505 

0,820 

0,301 

0,965 

0,223 

Sc, 

0,967(0,936) 

0,203(0.205) 

0,839 

0,412 

0,830 

0,315 

0,971 

0,169 

sc. 

0,197(0.303) 

0,195(0,957) 

0.011 

0,913 

0,072 

1,041 

0,195 

0,899 

K 

Sc  1 

0,919(0,658) 

0,324(0.329) 

0,955 

0,225 

0,922 

0,232 

0,942 

0,275 

U 

5p 

0,944(0,904) 

0,370(0,323) 

0,785 

0,652 

0,824 

0,416 

0,948 

0,371 

M 

Sp 

0.987(0,977) 

0,582(0,663) 

0,926 

1,300 

0,977 

0,613 

0,989 

0,932 

Tab  lei  1 1  . Correlations  using  IR  and  UV-Visible  Data 


« 

a  •  P 

A  , 

a 

6i,  02 

/  03 

solut*  P 

r» 

WBM 

£TE 

r» 

CTE 

c' 

ETE 

N 

0,»4fi(0,d»4) 

1,222(0,243) 

0,(04 

3,071 

0,  £(1 

1,731 

0,98( 

0,(21 

0 

0,930(0,8(3) 

1,575(1,307) 

0,(09 

3,279 

0,(30 

2,027 

0,960 

1,126 

p 

0,»S» (0,9(0) 

1,242(1.031) 

0,925 

1,54£ 

0,971 

0,835 

0,971 

1,038 

R 

0,9‘?2(Q.9S0) 

2,3(2 (2,503 

0,921 

3,(21 

0,5(6 

1,947 

0,976 

2,172 

K 

0,935(0,954) 

7, (82 ((,355) 

0,8(( 

10,l(e 

0,915 

8,192 

0,9(5 

5,(39 

Q 

0,994(0,991) 

0,449(0,381) 

0,538 

4,054 

0,923 

1.078 

0,992 

0,592 

S 

0,914(0,990) 

0,218(0,202) 

0,449 

1, 18( 

0,732 

0,989 

0,974 

0,275 

T 

0,9(3(0,984} 

0,297(0,192) 

0,752 

0,713 

0,8(5 

0,530 

0,971 

0,2(5 

U 

0,954(0,970 

0,328(0,202) 

0,7(7 

CO 

o 

0,988 

0,133 

0,984 

0,191 

V 

0,959(0,990) 

0,303(0,124) 

0,779 

0,(S5 

0.9e( 

0,139 

0,986 

0,178 

w 

0,895(0,9(9) 

0,3(9(0,241) 

0,271 

0,699 

0,559 

0,a5( 

0,612 

0,493 

X 

0,950(0,9(9) 

0,214(0,151) 

0,(73 

0,505 

0.973 

0,133 

0,976 

0,148 

Y 

0,90(0,9(9) 

0,380(0,390 

0,407 

1,352 

0,757 

1,031 

0,927 

0,530 

z 

0,745(0,(91) 

0,192(0,180 

0,198 

0,315 

0.  (41 

0,189 

0,693 

0,211 

Sr 

0,953(0,907) 

2,500(3,233) 

0,974 

1,613 

0,9(8 

0,214 

II)  "Some  commenta  on  "polarigabllity  effgfifea. 


The  starting  point  of  this  discussion  Is  an  important  study 
by  Laurence,  Nlcolet  and  Relchardt*^’ ,  dealing  with  the  comparison 
of  medium  effects  on  E,  and  n*.  Examined  therein  were  a  highly 
lipophilic  betaine  (A)  and  4>nltroanl80le  (b)  In  a  variety  of 
saturated  cyclic  and  acyclic  hydrocarbons,  perfluorinated 
solvents  and  in  the  gas^phase  [the  range  of  media  was  narrower 
for  (A)]. 

An  extremely  important  finding  was  that,  while  n*  shows  a 
strong  dependence  and  varies  linearly  with  f(n  )  -  (n"-i)/(2n**l) 
over  the  entire  range  of  values  for  n,  R,  la  essentially 
Independent  of  f(n). 

On  the  other  hand,  it  is  known**’  that  over  the  range  c-CgHu 
•  DMSO,  £,  and  (n*  •  dd)  for  non  hydrogen^bond  donor  solvents  are 
linearly  related.  It  seems,  therefore,  that  If  some  property  is 
well  correlated  by  x*  [or  (n*  -  dd)]  over  the  range  c-CgHj,  -  DMSO 
(0SR*S1.0)  one  cannot  safely  conclude  that  such  a  correlation 
will  still  hold  for  negative  n*  values,  (l.e..  In  the  range  gas- 
phase  -c-CtH^s ) . 


Use  of  the  simplified  form  of  McRae’s  equation  [equation 
(1)]  shows  why  It  Is  so: 

a’-constant+MC  ( n*-X )  /  ( 2n*+l )  ] +N{  [  ( €-1 )  /  ( 2€+l )  ]  ”  [  ( n*-l )  /  ( 2n*+l )  ] } 

In  general,  in  the  range  c-C,Hi,  -  DMSO,  a*  values  largely  reflect 
the  Importance  of  the  last  term  In  eq.  (1).  Over  the  range  gas- 
phase  -c-C,Hj2,  tills  term  Is  generally  small  or  nil  and  changes 
In  n*  essentially  reflect  changes  In  the  refractive  Index  of  the 
medium.  Now,  for  medium  effects  or  any  property  P  to  be  lenearly 
related  to  n*  over  the  entire  range  gas-phase  -  DMSO,  the  ratio 
N/M  has  to  be  the  same  for  P  as  for  n*. 

"A  priori",  there  is  no  reason  for  this  behavior  to  be 
general . 
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I  presented  two  esanples  os  properties  showing  a  very 
different  response  to  changes  in  the  refractive  iiuiex  of  the 
medium.  These  properties  are  known  to  satisfactorily  correlate 
with  (h'-dS)  for  Ti*>0. 

Differential  free-energies  of  activation  for  the 
quatemization  of  EtsN  with  Mel  are  seen  to  be  extremely 
sensitive  to  changes  in  n,  as  well  as  to  more  specific  "aromatic" 
effects^^’.  Fig.  1.  it  is  likely  that  some  contributions  from 
cavity  effects  are  also  embodied  in  this  linear  relationship. 

Standard  free-energy  changes  A6*  for  the  enolization  of 
acetylacetone,  equation  (2): 

CHjCOCHjCOCHj  t5=======±  CHjCOHCHCOCHj  (2) 

have  been  determined  in  the  gas-phase^*’  and  in  a  variety  of 
solvents*’* . 

He  were  stricken  by  the  fact  that  the  value  for  AG*  in  the 
gas^phase  was  nearly  the  sane  (within  the  limits  of  experimental 
error)  than  that  in  c-C,Kx2  solution. 

He  are  presently  exploring  this  reaction  in  solvents  of  low 
or  nil  dlpolarlty  having  widely  different  refractive  Indexes.  I 
presented  preliminary  results  in  perfluorodecalln,  n>heptane, 
cis.decalln,  quadrlcyclane  and  trans,  trans,  cis>  1,  S,  9- 
cyelododecatrlene.  It  was  clear  that  contributions  other  than 
dipolar  or  multipolar  effects  (case  of  benzene)  play  a  very  minor 
role  (if  any)  on  the  position  of  equilibrium  (2). 

It  thus  seems  that  good  correlations  with  n*  or(ii*-dd)  for 
n*20  do  not  necessarily  imply  any  such  correlation  for  k*<0. 
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FORMATION  CONSTANTS  IN  C-H  HYDROGEN  BONDING 


John  P.  Lorand 

Ma?.colm  H.  Filsun  Laboratories,  Department  of  Chemistry 
Central  Michigan  Univerrity 
Mt.  Pleasant,  MI  48859 


Formation  constants  for  Hydrogen  Bond  (H-bond)  complexes  of  a  wide 
variety  of  C-H  donors  with  HMPA  in  CCI4  at  35®  have  been  measured  by  a  pmr 
method.  The  donors  fall  into  classes  in  which  the  C  atom  hybridization  is  sp^, 
sp^  (ethylenes  as  well  as  substituted  benzenes),  and  sp  (terminal  alkynes).  In 
several  cases  where  constants  were  too  small  to  measure  in  CCI4,  cydohexane  as 
solvent  gave  measurable  K  values. 

Proton  chemical  shifts  for  series  of  solutions  were  analyzed  via  the  Higuchi 
equation,^  which  is  presented  in  viewgraph  1,  together  with  a  description  of  its 
use.  This  equation  is  derived  with  no  approximations  for  1:1  complexes.  No 
analogous  equation  has  been  derived  for  2:1  complexes,  (HA)2'B. 

As  the  examples  in  viewgraph  2  show,  plots  are  invariably  linear  when  the 
donor  contains  only  one  “addic”  proton,  l^en  two  or  more  equivalent  ^addic”- 
protons  are  present,  as  shown  in  viewgraphs  3  and  4,  the  Higuchi  plot  may  be 
concave  downward,  particularly  at  higher  base  concentrations.  This  curvature 
generally  disappears  at  lower  [HMPA],  the  range  depending  on  the  strength  of  the 
proton  donor.  Examples  are  fumaronitrile,  malononitrile,  and  the  1, 2,3,5-  and 
1,2,4,5-tetrahalobenzenes,  and  1,3,5-trihalQbenzenes.  K  values  are  here  reported 
only  for  low  concentrations,  and  have  been  statistically  corrected  by  dividing  by  the 
number  of  equivalent  H’s,  giving  “K/IT,  when  two  or  more  are  present. 

Data  for  C(sp  )-H  donors  appear  in  viewgraphs  5  and  6.  Several 
observations  emerge: 

(1)  The  heavy  halogens  are  about  equally  effective  at  fadlitating  H-bonding 
when  present  on  the  a-Carbon. 

(2)  F  is  less  effective  than  the  other  halogens  when  on  the  a-Carbon,  but 
more  effective  when  on  the  p-Carbon. 

(3)  The  cyano  group  is  c.  8  times  as  effective  as  a  heavy  halogen,  but  the 
nitro  group  only  c.  5  times  as  effective,  in  the  dihaloacetonitriles.  When,  however, 
two  of  each  group  are  present,  cyano  and  nitro  are  equally  effective.  The  largest  K 

value  measured  thus  far  is  205  M*^,  for  bromomalononitrile^  (not  shown  in 
viewgraphs). 

(4)  Carbonyl  groups  are  less  effective  than  halogens,  and  methoxy  is 
entirely  ineffective. 
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(6)  Viewgraph  7  shows  a  tendency  for  K  to  increase  with  the  sum  of  the 
inductive/held  substituent  constants,  but  with  considerable  scatter.  The  scatter 
can  be  explained  only  in  part  by  steric  effects.  The  order  of  K*8  is  somewhat 
scrambled  as  compared  to  that  for  substituent  constants. 

(6)  K  is  measurable  for  diphenylacetonitrile,  though  it  contains  only  one 
significantly  polar  fimctional  group. 

(7)  Shifts  of  the  C-H  stretching  band  in  the  IR,  -Av  (cm*l),  measured  in  the 
presence  of  DMSO-de,*^  are  almost  entirely  uncorrelated  with  K  values. 
Viewgraph  8  shows  this  plainly.  Difiuoroacetonitrile  and  phenyldinitromethane 
show  no  shift  at  all;  however,  their  C-H  stretching  bands  both  increase  in 
intensity. 

(8)  Substituent  effects  observed  for  Br  and  CN  can  be  used  to  predict  ICs  for 
methyl  bromide  and  for  acetonitrile.  K/H  for  the  latter  is  significant  and  should  be 
meastu^ble.  Both  values  can  in  turn  be  extrapolated  to  essentially  the  same  value, 
0.007,  for  K/H  for  methane.  This  might  suggest  that  methane  undergoes  H- 
bonding  with  HMPA.  However,  the  K/H  value  is  an  order  of  magnitude  below  the 
minimum  found  by  Abraham®  for  a  wide  variety  of  donors  and  acceptors. 
Therefore,  any  interaction  that  may  occur  between  methane  and  HMPA  is 
probably  not  an  H-bond. 

K’s  for  several  alkenes  with  HMPA  appear  in  viewgraph  9.  Again,  the 
cyano  group  is  much  more  effective  than  halogen;  cyano  and  nitro  are  about 

equally  effective;  ^-cyano  as  at  least  as  effective  as  a-cyano;  and  IR  ftequency  shifts 
are  nearly  independent  of  K 

K’s  for  several  terminal  alkynes  with  HMPA  are  shown  in  viewgraph  10. 
The  three  smallest  values  were  measured  in  cyclohexane,  while  the  three  largest, 
measured  in  CCI4,  are  calculated  using  the  fact  that  in  general 
k(cyclohexane)/k(CCl4)  ®  4.  The  Ks  cover  a  wide  range,  cyano  again  giving  the 
largest  value.  The  order  of  K’s  follows  the  order  of  substituent  constants  except  for 
ethoxy.  Most  likely  the  inductive/field  electron  withdrawing  effect  of  ethoxy  is 
coiinteracted  by  n-electron  donation  to  the  triple  bond.  All  the  IR  frequency  shifts 
are  large,  and  they  show  more  correlation  wifti  K’s  than  do  those  for  alkenes. 

K/H  values  for  series  of  polyhalobenzenes  appear  in  viewgraph  11.  Many  of 
these  were  measured  in  cyclohexane  because  of  their  small  magnitudes  in  CCI4. 
However,  most  values  for  polyfluorobenzenes  had  to  be  measured  in  CCI4  because 
the  fluorines  split  their  signals  sufficiently  to  render  them  invisible  by  the  FT 
technique  in  the  presence  of  the  very  strong  cyclohexane  signal. 

For  polyfluorobenzenes,  the  more  P  atoms,  the  larger  is  K/H.  Comparison  of 
tetrafiuorobenzenes  with  pentafluorobenzene  shows  that  F  has  the  largest  effect  at 
the  meta  position,  the  smallest  at  the  ortho  position.  Polychlorobenzenes  behave 
similarly;  however,  K/H  is  larger  for  1,2,3,4-tetrachlorobenzene  than  for 
pentachlorobenzene,  rather  than  smaller,  and  K/H  for  H-5  of  1,2,3- 
trichlorobenzene  is  larger  still.  K/H  values  tend  to  fall  in  the  order  P  >  Cl  >  Br,  but 
values  for  1,3,5-trihalobenzenes  are  indistinguishable,  and  so  are  those  for  1,4- 
dihalobenzenes.  A  value  of  K  for  2,4,6-tribromo-m-xylene  could  not  be  measured, 
at  least  in  CCI4,  because  HMPA  did  not  increase  the  chemical  shift. 

These  observations  imply  that  the  sizes  of  halogen  atoms  are  important  in 
determining  K/H.  The  principal  effect  is  “buttressing,  the  deformation  of  C-C-X 
and  even  C-C-C  bond  angles  by  two  or  more  larger  halogens  in  consecutive 
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positions  on  the  ring.  Thus,  it  is  known  firom  crystallographic  data  that  not  only 
hexachloro-*  but  also  pentachlorobenzene^  are  non-planar,  both  the  ring  carbons 
and  the  chlorine  atoms  lying  either  above  or  below  an  “average’*  plane. 
Hexafluorobenzene*  is,  however,  acctu'ately  planar.  Replacing  Cl-1  in 
pentachlorobenzene  by  H  to  form  l,2,3,4<tetracblorob6i\zene  both  relieves  part  of 
this  deformation  and  makes  each  H  more  accessible  to  an  HMPA  molecule.  The 
l,3,S'trihalobenzen68  have  no  buttressing,  so  their  K/H  values  reflect  the  intrinsic 
substituent  effects  of  the  halogens.  We  may  surmise  that  these  are 
indistinguishable-as  is  consistent  with  their  similar  inductive/field  constants. 
Buttressing  in  2,4,6-tribromo-m-xylene  must  be  severe,  and  K  may  well  be  smaller 
than  that  of  pentabromobenzene. 

The  effects  of  replacing  halogen  atoms  in  particular  locations  with  H  atoms 
may  be  calculated  and  used  to  predict  a  value  of  K/H  for  benzene  (cf.  viewgraph  12). 
The  polyfluorobenzenes  give  a  value  of  c.  0.05  M'^,  while  the  polychlorobenzenes 
give  c.  0.25  M'^,  both  in  cyclohexane.  An  attempt  to  measure  K/H  for  benzene  in 
cyclohexane  gave  a  fairly  good  Higuchi  plot  and  a  K/H  value  of  c.  0.05  M'^,  in  good 
agreement  with  prediction  for  the  polyfluoro  series.  Presxunably  the  substituent 
effects  in  the  polychloro  series,  determined  for  several  compounds  experiencing 
buttressing,  are  different  from  those  which  apply  in  the  absence  of  buttressing.  It 
is  notewoi^y  that  our  rough  value  for  benzene  falls  near  the  Abraham  borderline, 
and  it  is  desirable  to  seek  independent  evidence  as  to  whether  benzene  indeed  does 
or  does  not  represent  a  second  unsubstituted  hydrocarbon,  aO^r  terminal  alkynes, 
which  engages  in  H>bonding. 

In  conclusion,  this  work  has  shown  that  ICs  for  H-bonding  by  a  variety  of 
compounds  with  HMPA  in  either  CCI4  or  cyclohexane  can  be  measured  by  pmr, 
that  several  polar  substituents  are  slightly  to  very  elective  at  facilitating  H-bond 
formation;  that  ICs  overall  correlate  very  poorly  with  -Av  from  IR  spectra;  that  ICs 
correlate  fairly  well  with  inductive/freld  substituent  constants,  with  a  few 
exceptions;  and  that  benzene»and  less  likely,  methane-may  engage  in  H-bonding 
with  strong  acceptors.  It  had  already  been  shown  by  IR,  mainly  by  Allerhand  and 
Schleyer,^  that  H-bonding  by  C-H  bonds  is  more  common  than  had  been  supposed. 
Given  the  proper  tools,  it  should  be  possible  to  show  that  the  C-H  bonds  of 
acetonitrile,  nitromethane,  and  many  other  compounds  can  also  form  H-bonds. 
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MEASUREMENT  of  K  by  PMR 
THE  HIGUCHI  EQUATION 
Derived  for:  A(cid)  +  B(ase)  =  C(omplex), 

assuming  1:1  complexes  only: 


Cb  Ca  +  Cb  •“  Cc 

1 

Sobs  Sa  6c  —  6a 

• 

K(5c  -  6a) 

C’s  =  total  concentrations  (M) 

5’s  =  chemical  shifts 

Procedure: 

1)  Plot  Left  side  vs.  Ca  +  Cb: 

calculate  slope 

2)  Calculate  Cc  =  ; 

Oc  Oa 

replot  vs.  Ca  +  Cb  -  Cc  &  recalc,  slope 

3)  Repeat  (2)  until  slope  converges 

4)  Reject  deviant  points  &  repeat 

5)  K  =  slope/intercept 


53 


C(sp3)-H  DONORS  with  HMPA  in  CCI4 


DONOR  K  (M-1)  -Av  (crn'l) 


HCC13 

2.3 

29 

HCBrs 

2.0 

50 

HCI3 

2.8 

HCF2CN 

10.7 

0 

HCCI2CN 

20.6 

66 

HCBr2CN 

16.5 

80 

HCBr2N02 

10.3 

80 

HCCl2CONMe2 

1.44 

HCCl2COMe 

0.95 

HCCl2C02Me 

0.68 

26 

HCCl20Me 

c  0.3 

HCBr2CBr3 

1.1 

58 

HCBrClCFa 

2.5 

K:  CN  >  NO2  >  Cl,  Br,  I,  CF3  >  F,  COX,  CBrs  >  OMe 
a:  NO2  >  CN  >  F  >  Cl,  CF3,  Br  >  I  >  COX,  OMe 
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MOSTLY  DISUBSTITUTED  C(sp3)-H  DONORS 


w.  HMPA 

in  CCI4 

DONOR 

K,  M-i 

K/H,  M-1 

CH2Br2 

0.68 

0.34 

PhCHBr2 

0.68 

0.68 

p-02N-C6H4-CHBr2 

5.5 

5.5 

CICH2CN 

4.6 

2.3 

BrCH2CN 

7.1 

3.5 

ICH2CN 

6.5 

3.2 

Ph2CHCN 

1.0 

1.0 

CH2(SCN)2 

6.5 

3.3 

CH2(CN)2 

60. 

30. 

With  Acetone: 

CH2(CN)2 

3.39 

1.7 

Ph-CH(CN)2 

1.70 

1.7 

Ph-CH(N02)2 

1.69 

1.7 

With  HMPA,  . 
Estimated: 

n  05 

CHsCN 

0.43 

CH4 

0.007 

K;  CN  >  NO2  >  SCN  ~  P-O2N-C6H4  >  F,  Cl,  Br  >  CONMe2> 

COMe  >  C02Me,  Ph  >  OMe 

a:  N02>CN>F>C1,  CFs,  Br>I>C02Me,  COMe,  OMe  > 

CONH2  >  Ph 
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7 


Br-CH{CN)2 


I  i-2- 


CH5(CN)9  °P‘‘-CH(N02)2 

^oC!2CH.CN 

Br2CH-CN 

o  0F2CH.CN 
Uf2CH-N02 


0.4-  CH2(SCN)2  C4CH2CN  ^  * 

C02E»  CHB.3 

0.0-  Ph-CH-CN®  oBf3C.CHBr2 

Ph-c1iB,2  °CI2CH.C02M. 

-0.4- 

_  0CH2Bt2 

0.6  0.8  ,•  1.0  1.2  1.4  1.6  1.3 

Erj 

Figure  4.  Relation  between  K’s  and  sum  of  Taft  ot  values  (cf.  ref  18). 


Br.CH(CN)2«> 


l.S-Aph.CH(N02)2 


^  1 

?  l.QJp 


g-  1.0^F2CH-CN 


CH2(CN)2 

o 

CI2CH.CN  oBr2CH-CN 
Br2CH-N02 


CH2(SCN)2 

0  0-  oBr3C-CHBr2 

0Cl2CH-CO2Me  of 


-O.SH  oCH2Br2 


0CI2C-CHCI 

1  f 

CClz 


0  20  40  60  80  100  120 

Figure  3.  Relation  between  donor-HMPA  formation  constants  (/<’s) 
and  infrared  C-H  stretching  wavenumber  shifts. 
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C(sp2)-H  DONORS  w.  HMPA  in  CCI4 

ALKENES 


DONOR 

K/H  (M*l) 

-Av  (cm‘^) 

Cl2C=CH.Cl 

0.27 

41 

Rr2C=CH-Br 

0.23 

40 

Me2C=:CH-CN 

0.26 

E-PhCH«:CH-CN 

1.24 

• 

E-PhCH=:CH-N02 

1.06 

E  NC-CHsCH-CN 

7.2 

45 

Z-NC-CH=CH-CN 

6.2 

(NC)2C=CH-OEt 

18. 

55 

CN  -  NO2  »  X 
CN  -  X3 
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C(sp)-H  DONORS  w.  HMPA  in 
CYCLOHEXANE: 
ALKYNES 


io 


DONOR  K  (M-l)  .Av  (cm-1) 


CD 

C 

1 

O 

III 

o 

1 

I 

0.44 

82 

EtO-C=C-H 

0.60 

I 

1 

o 

III 

o 

! 

x: 

o. 

1.00 

105 

BrCH2~C=C— H 

2.12 

102 

EtOaC-CsC— H 

6.6 

NC-CSC-H 

• 

00 

155 

K:  NC-  >  .Et02C-  >  BrCH2-  r>  Ph-  >  EtO-  >  R- 

a:  NC-  >  Et02C-  >  EtO-  >  BrCH2-  >  Ph-  >  R- 

K  for  EtO-  is  out  of  place. 
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C(sp2)-H  DONORS  +  HMPA  in  CYCLOHEXANE 

AROMATICS 


DONOR 


H 


H 


IC/H  (M*l) 


X=:F 

Xr:  Cl 

X  =  Br 

2.0 

0.77 

-0.1 

1.5 

1.05 

0.55 

0.38 

1.3 

0.53 

0.57 

0.22 

0.21 

0.23 

toosinaJl 
to  measure 

H-4,6 

0.60 

H-5 

1.5 

0.18 

0.23 
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C(sp2)-H  DONORS  w.  HMPA  in 
CYCLOHEXANE: 
AROMATICS,  Cont’d. 


MLOVTI 


DONOR  X=:  F  X  =  Cl 

Benzene~measured  -0.05  -0.05 


Benzene-extrapolated 


0.05  0.25 


Steric  crowding  is  important: 

1)  F  >  Cl  >  Br,  except  for  1,3,5-Tri-  &  1,4-Di-; 

2)  CeHCls  <  l,2,3,4-C6H2Cl4; 

3)  2,4,6-Tri-Br-m-xylene  too  small  to  measure; 

4)  K/H  for  Benzene  extrapolated  from  poly-Cl- 

benzenes  disagrees  w.  measured  value. 


63 


BLAHK 


64 


Using  Th«oretkal  Descriptors  in  Quantitative 
Structure  Activity  Relationships 

George  R.  Famini  and  Leland  Y.  Wilson,  U.S.  Army  Chemical  Research, 
Development  and  Engineering  Cerder,  Aberdeen  Proving  Ground,  MD,  SlOlO,  and 
Loma  Linda  University,  Riverside,  CA,  98515 


1.  Introduction 

Quantitative  Structure  Activity  Relatiorwhips  (QSAR)  have  been  used  extensively 
in  correlating  structural  features  to  physic2il,  biological  and  toxicological  properties. 
The  basic  tenet  of  QSAR  is  that  there  is  a  connection  between  the  microscopic 
(molecular  structure)  and  the  macroscopic  (empiricad)  properties.  Further,  this 
connection  can  be  used  to  predict  empirical  properties  directly  from  the  molecular 
structure.  This  relationship  was  first  quantized  by  Hammett,  who  developed  the 
Linear  Free  Energy  Relationship  (LFER). 

IJ  Linear  Solvation  Energy  Relationships 

Based  on  the  concept  of  Hammett,  Kamiet  and  Taft  developed  a  methodology  for 
developing  Free  Energy  Relationships  based  on  solute/solvent  interactions.  This 
relationship,  defined  as  the  General  Linear  Solvation  Energy  Relationship,  is  shown  in 
equation  1. 

LOG  Property  =  Steric  -f  Polarizability  +  Hydrogen  Bonding  (1) 

In  this  way,  a  given  property  can  be  described  as  a  linear  expression  consisting  of 
contributions  due  to  steric  factors,  polarization  and  polarizability  factors,  and 
hydrogen  bonding  factors.  In  the  multiple  solute  •  single  solvent  system,  the  specific 
terms  are:  Steric-  Molar  Volume  Polarizability-  spectroscopically  determined 

polarizability  (tt*),  Hydrogen  Bonding-  spectroscopically  determined  zu:idity  and 
basicity  terms  (q  and  0),  In  practice,  not  all  four  terms  are  required  for  every 
relationship.  The  coefficients  and  associated  statistical  t-scores  can  be  used  to  gauge 
the  importance  of  each  descriptor  for  every  property  correlation.  In  this  way  the 
LSER  can  be  used  to  infer  insight  to  solute/solvent  interactions. 

One  major  difficulty  of  this  approach  has  been  the  nature  of  the  descriptors.  All 
are  empirically  determined,  therefore  reducing  the  usefulness  of  this  approach  for  a 
priori  predictions.  Some  attempts  have  been  made  correlating  more  fundamental 
structural  and  electronic  descriptors  with  the  Kamlet-Tadt  solvatochromic 
parameters  with  moderate  degrees  of  success.  ® 
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1.2  Applications  of  Theoretical  Chemistry 

Theoretical  chemistry  has  been  used  in  the  past  to  supply  structural  and  electronic 
descriptors  for  QSAR  and  QSAR-like  equations.  In  this  way,  empirically  derived 
descriptors  can  be  replaced  in  relationships  such  as  the  LSER  with  descriptors  that 
will  make  the  relationships  more  generally  applicable.  The  Theoretical  Linear 
Solvation  Energy  Relationship  (TLSER)  is  su.ch  a  derivation,  using  the  LSER 
philosophy  and  general  structure,  but  replacing  the  empirically  derived  descriptors 
with  computationally  derived  descriptors. 

2.  Calculatiooai  Procedures 

All  geometries  were  optimized  using  the  MNDO  algorithm  within  MOP  AC. 
The  molecular  voliimes  were  generated  from  the  optimized  geometries  using  the 
method  of  Hopfmger,  as  incorporated  in  the  U.S.  Army  developed  molecular 
modeling  package  MMADS.®-®  All  experimental  data  and  LSER  parameters  were 
taken  from  works  of  the  original  authors. 

3.  The  TLSER  Descriptors 

The  TLSER  descriptors  have  been  developed  with  two  main  goals  in  mind. 
First,  the  TLSER  descriptors  should  correlate  optimally  with  the  LSER  parameters. 
Second,  the  property  correlation  equations  with  the  TLSER  should  yield  correlation 
coefficients  and  standard  deviations  as  accurate  as  the  LSER.  Further,  the  TLSER 
descriptors  should  be  as  generally  applicable  to  any  solute/solvent  interaction  as  are 
the  LSER  descriptors. 

The  TLSER  descriptors  that  have  been  developed  follow  that  of  the  LSER,  and 
fit  into  one  of  the  categories  listed  in  equation  1.  The  steric  term  for  the  TLSER  is 
the  molecular  Van  der  Weed’s  volume  Volume  calculations  of  this  type  are 

standard  in  most  molecular  modeling  packages  available  today.  As  would  be 
expected,  correlation  between  V^^^  and  V^^^^  is  high,  with  a  correlation  coefficient  of 
0.979  and  a  standard  error  of  the  estimate  of  5  ml/mol. 

The  polarizability  term  used  in  the  TLSER  is  determined  from  the  method  of 
Stewart  and  Dewar  incorporated  in  MOPAC  Dividing  the  resulting  polarizability 
by  the  V^^.  results  in  a  size  independent  Polarizability  Index  (tTj).  This  term  defines 
the  eases  in  which  the  electron  cloud  can  be  moved  or  polarized.  Aromatics  would 
rank  high  on  the  scale,  and  alkanes  low.  In  addition,  the  tTj  is  inversely  proportional 
to  the  electronegativity. 

Like  the  LSER,  the  hydrogen  bonding  term  is  separated  into  acid  (acceptor)  and 
and  base  (donor)  terms.  Furthermore,  because  the  hydrogen  bond  (or  the  Lewis 
bond  or  in  essence  any  bond)  can  be  separated  into  covalent  and  electrostatic  parts, 
individual  descriptors  <ire  needed  for  the  TLSER  to  describe  this.  The  energies  of 
the  highest  occupied  molecular  orbital  (representing  the  basicity,  (s^)  and  the  lowest 
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unoccupied  molecular  orbital  (representing  the  acidity,  axe  used  to  describe  the 
covalent  interactions.  Similarly,  the  electrostatic  portions  of  the  basicity  and  acidity 
are  represented  by  selected  atomic  formal  charges  in  the  molecule,  either  the  most 
negative  (for  the  basicity,  q^)  or  the  most  positive  hydrogen  (for  the  acidity,  q_p. 
The  basicity  terms  (cj  and  q  )  correlate  highly  with  the  LSER  term,  with  a 
correlation  coefficient  of  0.9518  and  a  standard  error  of  the  estimate  of  0.09. 

The  final  generedized  TLSER  equation,  Is  shown  below: 

LOG  P  +P^i  +a'qH  _|^  +  bej^  +b'q_ 

The  relationship  is  similar  in  appearance  to  the  LSER,  emd  contains  all  of  the  key 
points  that  have  m.ide  the  LSER  successful. 


4.  Representative  Correlations 

Similar  to  the  LSER,  all  six  TLSER  descriptors  are  seldom  required  or  are 
statistically  significant  for  a  given  solute/solvent  property.  Generally,  the  equations 
reduce  to  a  three  or  four  descriptor  correlation.  Table  1  shows  a  representative  set 
of  correlations  using  the  TLSER  descriptors.  In  each  case,  the  descriptors  that 

were  significant  at  the  95%  confidence  level  were  included,  and  all  others  dropped 
from  +he  correlations.  An  in  depth  treatment  of  this  technique  is  given  in  a  recent 
journal  article,  and  several  technical  reports,  The  next  paper  will  describe  some 
correlations  completed  in  depth  pertaining  to  selected  physical  properties. 

Table  1.  TLSER  Correlations 


Property 

me 

«b 

q. 

Esn 

N 

R 

SD 

Kow 

2.995 

-0.847 

1.730 

n/a 

-5.415 

n/a 

-3.960 

64 

0.957 

0.357 

LCjq 

-0.928 

-10.557 

-1.442 

n/a 

0.443 

n/a 

18.082 

32 

0.943 

0.574 

EC^ 

-4.067 

-4.147 

3.902 

n/a 

-2.777 

n/a 

11.356 

25 

0.982 

0.332 

EA 

n/a 

-2444 

n/a 

3540 

1430 

-383 

20545 

23 

0.951 

308 

1.185 

-0.610 

n/a 

n/a 

-2.290 

-1.077 

-0.083 

19 

0.986 

0.073 

^OH 

-97.7  i 

. .  j 

874 

1256 

n/a 

n/a 

n/a 

-2184 

.10 

0.986 

26.9 

C(Tadpole) 

-2.13 

-4.S91 

-4.980 

n/a 

5.00 

n/a 

11.99 

39 

0.979 

0.237 

j  C(Golc;  Orfe) 

2-2.64 

94.885 

n/a 

n/a 

4.15 

-1.77 

7.55 

0.237 

67 


5.  Conclusions 

The  diversity  of  the  properties  successfully  correlated  with  the  TLSER  show  the 
general  applicability  of  both  the  LSER  and  the  application  of  theoretical  techniques 
to  the  LSER.  In  addition,  the  TLSER  descriptors,  based  ^lely  on  theoretically 
derived  and  determined  parameters,  result  in  as  good  a  correlation  as  the  LSER 
descriptors.  Using  these  equations,  then,  solute/solvent  interactions  can  be  related  to 
fundamental  descriptions  (structural  and  electronic)  of  the  molecule.  Furthermore, 
using  the  TLSER  descriptors,  a  priori  prediction  of  solute/solvent  properties  can  be 
made  using  these  equations. 
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USING  THEORETICAL  DESCRIPTORS  IN  QUANTITATIVE 
STRUCTURE  ACnVITY  RELATIONSHIPS;  SOME  PHYSICOCHEMICAL  PROPERTIES 

George  R.  Famini,  Carl  A.  Pensld  and  Leland  Y.  Wilson* 

U.S.  Army  Chemical  Research,  Development  and  Engineering 
Center, Aljerdeen  Proving  Ground,  MD  21010  and  •Department  of 
Chemistry,  La  Sierra  Umversity  Riverside,  Riverside,  CA  92515 


lotroductioQ: 

Quantitative  Structure  Activity  Relationships  (QSAR)  have  been  used  extensively  to  corre¬ 
late  particular  structural  features  of  compounds  with  their  known  biological  properties  (activi¬ 
ties).  For  a  selected  property,  an  equation  is  obtained  that  permits  tiie  prediction  of  biological 
activities  of  compounds  analogous  to  the  compounds  in  the  data  set'.  Often  the  chosen  structural 
parameters  (descriptors)  have  been  limited  in  their  ability  to  account  for  anything  but  local  data 
sets.  QSAR,  as  develo]^  by  Hansch,  has  been  greatly  modified  by  several  researchers  to  in¬ 
crease  its  ability  to  correlate  properties^"’.  Kamlet  and  Thft  developed  an  empirical  parameter 
set,  called  the  solvatochromic  parameters,  for  use  in  Linear  Solvation  Energy  Relationships 
(LSER).  These  LSER  parameters  have  b^n  very  successfiil  in  correlating  a  wide  range  of  bio¬ 
logical  activities;  moreover,  a  strong  point  of  these  descriptors  is  that  they  also  correlate  a  wide 
range  of  chemical  and  physical  properties  involving  solute/solvent  interactions 


Theoretical  Linear  Solvation  Energy  Relationships: 

Based  on  the  LSER  concept,  a  new  set  of  parameters  for  correlating  a  wide  variety  of 
properties  has  been  developed  Tfermed  the  Theoretical  Linear  Solvation  Energy  Relationship 
(TLSER)  descriptors,  these  parameters  are  determined  solely  from  computational  methods  thus 
permitting  a  priori  prediction  of  properties.  As  indicated  by  George  (Famini)  in  the  previous 
presentation  there  are  six  TLSER  descriptors:  molecular  volume,  V^,;  polarizability  index,  Xj; 
covalent  basicity,  electrostatic  basicity,  q^;  covalent  acidity,  e  ;  and  electrostatic  acidity, 
q^^.  The  last  four  descriptors  describe  the  covalent  and  electrostatic  contributions  to  the  hydro¬ 
gen  bonding  basicity  and  acidity.  Increasing  and  e^b  values  indicate  decreasing  acidity  and 
basicity,  respectively.  The  parameters  are  scaled  by  powers  of  ten  for  convenience  in  presenta¬ 
tion  and  comparison  of  coefficients.  The  generalized  equation  is 

log  P  =  aV^  -l-bxj-h  C€^-t-  d<L+  ec,+  fq^^-t-  g 

where  P  represents  the  property.  For  a  given  property,  the  coefficients,  a  through  g,  are  deter¬ 
mined  by  fitting  data  for  a  set  of  compounds.  Often  not  all  terms  are  needed. 

Thble  2  shows  the  results  of  the  application  o^  the  TLSER  parameters  to  some  biological 
activities  from  a  previous  paper  Ikble  1  give  the  results  for  the  corresponding  LSER  correla¬ 
tions.  Both  give  good  correlations  and  have  physically  reasonable  interpretations.  The  descrip¬ 
tors  have  very  small  cross  correlation  in  these  data  sets. 

Properties  in  this  presentation  include  pKa’’*,  charcoal  adsorption®,  HPLC  retention 
index'*’,  electronic  absorption  of  a  pyridinium  yiide“,  Kow  (octanol- water  potion 
coefficient)*^,  and  the  rate  constant  for  the  hydrolysis  of  organophosphonothioates*’. 
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Calculational  Procedure: 

Molecular  geometries  were  optimized  and  TLSER  descriptors  were  calculated  using  the 
MNDO  algorithm  contained  in  MOPAC‘^*‘^.  The  molecular  volume  for  the  optimized  geomet^ 
was  determined  using  the  algorithm  of  Hopfinger^^.  The  in  house  developed  molecular  modeling 
pack^e,  MMADS,  was  us^  to  construct  and  view  jdl  molecular  structures^’.  Multilinear  re¬ 
gression  analysis  was  used  to  obtain  the  coefficients  in  the  correlation  equation.  Terms  signifi¬ 
cant  at  the  0.95  level  were  retained  and  compounds  with  residuals  greater  than  three  standard 
deviations  w^  considered  outliers. 

The  properties  and  compounds  were  chosen  to  coincide  with  sets  of  data  that  describe  a 
variety  of  solvent/solute  interactions. 


Results: 

The  TLSER  equation  obtained  for  the  pKa  of  a  series  of  organic  compounds  (including 
mono-  and  dicarboxylic  acid,  alcohols,  thiols,  ketones,  phosphorous  and  nitrogen  containing 
species)  is 

pKa«  -86.7  +  228e^  -  62.3qH^ 

(-6.44)  (13.1)  (-8.19) 

n=43  R=0.971  s.d.=2.78  F=220 

The  quantities  in  parentheses  are  the  t-statistic  values  for  the  coefficients.  There  is  a  strong 
correlation  between  and  indicating  that  one  should  be  eliminated  from  the  equation.  The 
resulting  equation  is 


pKa=  127  -  48.7  q^,^ 

(11.7)  (-6.94) 

n=*43  R=0.940  s.d.=2.79  F=156 

These  equations  lead  to  the  following  observations.  (1)  Only  the  electronic  descriptors  are 
significant  (more  specifically  the  hydrogen  bonding  terms).  (2)  The  pKa  increases  with  decreas¬ 
ing  au;idity.  (3)  The  pKa  increases  with  increasing  basicity.  These  inferences  are  chemically 
reasonable. 

For  charcoal  adsorption  from  aqueous  solution  for  a  set  of  compounds  primarily  made 
up  of  alkyl  alcohols,  ketones,  and  aldehydes  the  relation  is 

log  A  =  2.46(V^/100)+5.78xj-4.66q,-6.68. 

n=33  R=0.955  s.d.=0.24  F=99.8 

This  equation  leads  to  the  following  observations.  (1)  The  molecular  volume, V  ,  ,  increases 
absorption  on  charcoal;  larger  molecules  would  tend  to  be  excluded  from  the  ^ueous  phase. 
(2)  The  polarizability  index,  Tj,  increases  absorption  on  charcoal;  greater  polarizability  would 
permit  greater  interaction  with  the  pi  electrons  on  the  charcoal.  (3)  The  electrostatic  basicity 
decreases  absorption  on  charcoal;  greater  charge  would  tend  to  increase  the  interaction  with  the 
(polar)  water  molecules. 
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For  a  set  of  all^l  and  compounds  the  equation  for  the  HPLC  retention  index  in  a 
50%  methanol-water  mixture  is 

log  kSO  =  l.Z3(yjm)  -3.05q.  -1.52q„^  -0.459, 

(11.8)  (-10.8)  (-2.96)  (-2.71) 

n=«21  R=0.980  s.d.  =0.124  F=141 

The  equation  leads  to  the  following  observations.  (1)  The  volume  term  increases  the  retention 
on  the  column;  the  larger  molecules  would  tend  to  excluded  from  the  polar  solvent.  (2)  The 
electrostatic  bonding  descriptors  decrease  the  retention;  there  is  greater  interaction  with  the 
(polar)  water  molecules.  We  might  expect  these  if  the  HPLC  column  is  not  polar.  The  relation 
for  a  75%  methanol-water  solution  h^  the  same  terms  and  signs  and  has  R=0.981. 

For  the  electronic  absorption  for  the  three  pyridinium  ylides  (in  a  set  of  alkyl  and  aryl 
solvents)  examined  here  the  best  fit  was  for  a  compound  with  two  ethoxy  groups.  The  relation 
for  the  electronic  absorption  (cm*‘)  of  this  compound  is 

EA=  -45734tj  +8733q„^  +27629. 

(-7.40)  (9.02)  (39.4) 

n=22  R=0.960  s.d.=279  F=112 

This  equation  leads  to  the  following  observations.  (1)  The  absorption  peak  position  depends  on 
electronic  descriptors;  this  is  reasonable  since  it  involves  transitions  in  electronic  states.  (2)  The 
polaiiaability  index  decreases  the  position  of  the  absorption  peak  (a  red  shift);  this  implies  an 
mteiaction  of  the  pi  electrons  on  theylide  ring  with  the  electrons  of  the  solvent.  (3)  The  electro¬ 
static  acidity  increases  the  position  or  the  absorption  pe^  (a  blue  shift);  it  suggests  an  interac¬ 
tion  of  an  acidic  site  on  the  solvent  molecule  with  a  basic  site  on  the  pyridinium  ylide  (maybe 
the  negatively  charged  carbon  next  to  the  nitrogen  on  the  ring).  The  relations  for  two  other 
ylides  have  the  same  terms  and  signs  as  in  the  above  equation;  their  R  values  are  0.904  and 
0.930. 

The  rate  constant  for  the  hydrolysis  of  a  set  of  organophosphonothioates  is  described  by 
this  equation. 


log  kOH  =  -13.1  Tj  -  6.1 1  6,  +  7.09. 

(-2.98)  (-40.2)  (13.6) 

n=35  R=0.990  s.d.=0.117  F=817 

This  relation  makes  the  following  points.  (l)The  rate  constant  depends  only  on  the  electronic 
descriptors;  the  bulk/steric  term  of  the  substituents  in  this  set  of  compounds  was  not  significant. 
It  could  imply  that  the  mechanism  involves  attack  near  the  P-S  bond  away  from  the  R  groups. 
(2)  The  rate  decreases  with  increasing  polarizability;  greater  ease  in  distortion  of  the  electron 
"cloud"  interferes  with  tlie  reaction  site.  (3)  The  rate  mcreases  with  increasing  hydrogen  bond¬ 
ing  acidity;  this  is  in  keeping  with  the  involvement  of  a  hydroxide  ion. 

A  selection  of  aliphatic  and  aromatic  compounds  gave  the  following  relationship  for  the 
octanol'water  partition  coefficient. 

log  Kow  =  3.00  (V_^/100)  -8.47  Tj  +17.3  -5.42  q  -3.96. 

(11.7)  (-0.37)  (2.76)  (-11.6) 

N=64  R=0.957  s.d.=0.36 
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C^servations  from  this  relation  follow.  (1)  The  bulk/steric  (V  )  term  is  significant  as  it  is  in  the 
previous  distribution  ^uilibiia:  charcoal  absorption  and  HPIX  retention  index.  The  larger  the 
volume  the  more  readily  the  molecule  tends  to  be  excluded  from  the  polar  solvent.  (2)  ^e 
hydrogen  bonding  basicity  enhances  die  solubility  in  the  octanol  over  that  in  water;  an  interpre- 
tation  Is  that  an  interaction  with  the  lowest  unoccupied  molecular  orbital  (LUMO)  in  the  octanol 
twids  to  be  favorwi  over  the  corresponding  one  in  water.  (3)  The  more  negative  the  formal 
charge  the  less  the  tendency  to  dissolve  in  the  octanol;  this  is  in  keeping  with  a  charge-dipole 
interaction  between  the  negative  charge  and  the  polar  water  molecule.  (4)  The  polarizability 
index  is  not  significant  but  was  included  because  the  corresponding  LSER  relation  contained  an 
analogous  term;  however  the  magnitudes  and  signs  were  very  similar. 


Conclusions: 

The  implication  of  the  TLSER  descriptors  to  the  six  physico-chemical  properties  in  this 
paper  as  well  as  Ae  W  toxicological  prop^es  studiai  earlier  point  out  these  features:  (1)  The 
descriptors  give  good  to  very  good  correlations  ;  (2)  They  give  physically  reasonable  interpreta¬ 
tions  ;  (3)  They  apply  to  a  wide  range  of  properties  ;  (4)  They  apply  to  a  wide  range  of  com¬ 
pounds. 

An  attractive  feature  is  that  they  are  obtained  from  computation  and  not  experiment. 
Consequently  they  permit  almost  a  priori  prediction  of  properties.  Furthermore,  the  physical 
interpretation  can  suggest  modes  of  interaction  between  solute  and  solvent  molecules;  in  that 
sense  they  provide  a  probe. 
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iPttOdUCtiflQ 


We  have  found  several  conelations  between  computed  properties  [the  electrostatic  potential 
V(r)  and  the  average  local  iotiization  energy  l^r)]  and  (a)  die  solvatochromic  hydtogen-bond*clonor 
and  -accepts  parameters  a  and  p,  (b)  the  Hammett  constants  CFp  and  and  (c)  the  inductive 

parameters  oi  [1-7].  These  relatitHiships  will  be  discussed  following  a  brief  synopsis  of  our 
cc»nputati<mal  methodology. 


Computed  Propcgjfis 


1.  Electrostatic  Potential  V(r): 

The  electrostatic  potential  V(r)  that  dui  nuclei  and  electicms  of  a  molecule  create  in  the 
surrounding  space  is  expressed  rigorously  by  eq.  (1): 


V(r)*5^ 

A 


J  li'-il 


(1) 


Za  is  the  charge  on  nucleus  A,  located  at  Ra;  p(r)  is  the  electronic  density  function  of  the 
molecule.  V(r)  expresses  the  net  electrical  effect  of  the  nuclei  and  electrons  of  a  molecule  at  any 
point  r  in  the  surrounding  space,  and  is  well  established  as  an  effective  tool  in  interpretuig  and 
predicting  the  reaedve  behavior  of  molecules  [3,8-11].  The  sign  of  V(r)  in  any  particular  region 
depends  upon  whether  the  effects  of  the  nuclei  or  the  electrcms  are  dominant  there.  Thus  an 
approaching  electrophile  will  initially  be  attracted  to  those  regitms  in  which  V(r)  is  negative,  and  in 
particular  to  the  points  where  VCr)  has  its  most  negative  values  (the  local  minima,  ot  Vmjn)* 

While  sites  susceptible  to  electrophilic  attack  can  be  identified  and  ranked  quite  readily  by 
means  of  the  local  V(r)  minima  [3,8-1 1],  the  situation  is  not  as  straightforward  for  nucleophilic 
interactions,  due  to  the  fact  that  V(r)  maxima  are  found  only  at  tlte  positions  of  the  nuclei  [12]. 
These  reflect  the  magnitudes  of  the  nuclear  charges  and  therefore  cannot  be  assumed  to  indicate 
relative  reactivities  toward  nucleophiles.  There  is  accordingly  no  criterion  fOT  nucleophilic 
interactions  that  corresponds  directly  to  Vmin  for  electrophilic  ones.  However  we  have  developed 
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techniques  tiiuit  do  allow  the  electrostatic  potential  to  be  used  to  intexpiet  nwlec^hilic  processes 
[5,13-18].  For  example,  invesdgation  of  V(r)  on  two-  or  thiee-dizncnsional  sur&ces  significantly 
itisoved  fiom  ti»  nuclei  has  le^^aled  buildups  of  positive  potential  that  do  tefiect  relative  afflnides 
for  nudet^hiles  [5,14-18].  One  particularly  promising  approach  involves  pltxting  V(r)  cm  a 
molecular  surfiice  defined  by  the  0.002  or  0.001  electroo^xrbt^  contour  of  die  electrcxtic  density 
functioa  p(r)  [19].  Hicse  txtntours  have  been  shown  to  contain  at  least  95%  of  the  total  electronic 
charge,  and  to  provide  physically  reasonable  molecular  dimenarms  [20,21]. 

2.  Average  Local  Ionization  Energy  i(r) : 

We  have  recently  introduced  a  property  which  we  interpret  as  the  average  local  icnizatiou 
energy,  I(r)  [2].  It  is  rigorously  defined  within  the  ftamework  of  sdf-ctmsistent-fidd  noolecular 
orbital  (SCF-MO)  theory  by  eq.  2. 


pi(r)  is  the  electnxiic  density  of  the  i^*  molecular  orbital  at  the  point  r,  ei  is  the  orbital  energy  and 
p(r)  is  die  total  electronic  densiQr  function  of  die  molecule^  We  have  shown  that!(r)  plotted  on 
wdl-defined  molecular  surfaces  is  a  quantitative  indicaKir  of  leacdvi^  toward  electrqihiles  in  a 
variety  of  chemical  systems  [2,7,22-26].  The  positions  where  tte  surface  l(r)  has  its  lowest 
values  ^jnin)  are  die  sites  with  the  least  tightly  bound  electrons,  and  thus  most  susceptible  toward 
electrophilic  attack. 


HYdrosgn-BoodrPonor  and  lAcccptor  Cgrelatjons 

Linear  solvation  energy  relationships  based  upon  the  use  of  "solvatochromic  parameters" 
have  resulted  ftom  extensive  efforts  to  quantify  solvent  effects  on  various  experimentally-observed 
quantities  (e.g.  rate  constants,  equilibrium  constants  and  IR,  NMR,  ESR  and  UV/vis  absorption 
maxima  and  intensities)  [27-31].  Two  of  the  solvatochromic  parameters,  designated  as  a  and 
have  been  interpteted  as  providing  measures  of  a  solvent's  ability  to  drxiate  or  accept  a  proton, 
respectively,  in  solute-solvent  hydrogen  bonding  [31]. 

We  have  recently  found  that  good  relationships  exist  between  P  and  the  most  iwgative 
electrostatic  potentials  associated  with  the  hydrogen-bond-accepting  heteroatcxns  in  four  families  of 
conqxxmds,  taken  separately:  ten  azines,  four  primary  amines,  four  alkyl  ethers  and  fifteen 
molecules  containing  double-bonded  oxygens  [4].  The  coirelatitHi  coefficients  were  found  to  be 
0.96, 0.98, 0.94  and  0.95,  respectively.  We  computed  the  Vnun  at  the  STO-5G  level,  using 
STO-3G  optimized  geometries. 
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In  view  of  the  good  coneladcms  bctwen  P  and  Vtnin.  it  was  suggested  that  there  might 
also  be  one  between  the  hydrogen-bond-donating  paranretcr  o  and  positive  regions  of  V(r)  [32]. 
Since  this  coonesponds  to  interaction  with  a  nuclecq)bile,  we  ctanputed  V(r}  on  the  surfaces  of  the 
molecules;  these  u«re  defined  by  the  0.001  electnm/bohi^  cemmur  of  the  molecular  electronic 
density.  We  did  indeed  find  good  relationships  between  a  atui  the  nmst  positive  surface 
electrostatic  potential,  Vs^nax.  for  two  groups  of  hydrogen  boi^  donors  taken  separately  [5,6];  the 
correlation  coefficients  for  both  are  0.97.  In  all  instances  the  Vsjmax  aie  associated  with  hydro¬ 
gens  that  couM  be  donated  in  a  hydrogen  bond.  The  electrostatic  potentials  were  computed  at  the 
STO~5G  level  (using  ST03G  optimized  geometries). 

These  relationships  between  a  aiui  Vs^nax  P  and  Vmin  confirm  that  the  electrostatic 
potential  in  the  space  around  a  ^  phase  molecule  is  a  key  (but  not  die  sole)  factm:  in  determining 
its  ability  to  donate  or  accept  a  proton  in  a  hydrogen  boml,  and  they  provide  a  predictive  capability 
fCH*  obtaining  unknown  a  and  P  values. 

Hammett  Constant  Correlation 

We  have  shown  f<»:  a  series  of  monosubstituted  benzenes  0  [2]  and  some  azines  and  azdne 
N-oxides  [e.g.  pyridine  (H)  and  pyridine  N-oxide  (HDl  [23,24]  that  is^  values  are  a  quantitative 
measure  of  the  activating/deactivating  and  directing  temiendes  of  the  various  substituents  in  regard 
to  clecm^hilic  aromatic  substitution.  The  ring  carbon  Is,min  of  systems 

X  =  NH2,  CH3,  H.  F, 

CH0,N02 


bearing  ortho-  and  para-  directing  substituents  (or  atoms)  (NH2,  CH3,  F,  N^'-O")  arc  located  above 
the  ortho  and  para  ring  positions,  while  those  for  species  with  meta-directing  substituents  or  atoms 
(NO2,  CHO,  N)  are  positioned  above  the  meta  [2,23,24].  For  ihc  former,  the  Isniin  ar®  smaller  in 
magnitude  than  for  benzene,  showing  that  the  ring  has  been  activated,  while  fw  the  latter  the 
reverse  is  observed.  Is,min  are  also  found  in  the  vicinities  of  heteroatoms  containing  lone  pairs, 
e.g.  O,  N,  F  and  S. 

The  relationship  between  the  ring  carbon  1$^  and  aromatic  reactivity  can  also  be  put  on  a 
more  quantitative  basis,  as  evidenced  by  the  excellent  correlation  found  between  the  ring  carbon 
ISjnin  of  the  monosubstituted  benzenes  and  the  corresponding  Hammett  constants  <5p  and/or  <5m 
[2].  (Linear  correlation  coefficients  of  0.99  are  obtained  with  i(r)  computed  at  both  the  6-3 IG* 
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and  ST05G  levels,  using  ST0-3G  geometric  [2,7].)  The  Hamnett  constants  arc  well- 
established  tneasuies  of  the  overall  electron-withdrawing  aiul  -donating  teialencKs  of  substituent 
groups  m  bcnaaie  rings  [33-35].  This  relationship  allows  us  to  naake  predictions  of  Hammett 
constants  har  substituents  fen*  which  these  are  not  known.  ¥ot  exacqple,  we  have  predicted  Op  for 
NHF  to  be  0.06,  and  Cp  and  for  NF2  to  be  0.49  and  0.54,  respectively  [2]. 

Cfflgtetjons  with  ot 

We  have  dmuBistrated  that  the  amine  nitrogen  Vmin  of  NH2-X  molecules  provide  a 
quandtadve  noeasare  of  the  total  electrcm-anracting  tendencies  of  toe  substituents  X  [1].  We 
oanputed  Vmin  at  the  STO-5G  level  using  ST03G  geoototries  fw  twenty-four  NH2-X  motecules 
and  found  correlations  between  Vmin  ^  ^  Vmm  and  01  oir  utoen  Oft  >  0  (electrcm- 

witodrawing);  the  condatiem  coefScients  were  0.90  and  0.92,  rrapectively  [1].  For  a  subgroi^  of 
thei%  molecules,  we  have  more  recently  observed  that  both  Vmin  aid  ts^nin  conelate  well  with  oi 
or  01  +  OR,  when  OR  >  0;  conelation  coefficients  range  from  0.97  to  0.99  [6]. 


Correlations  with  pKa 


Since  Isnun  is  evidently  a  good  index  of  reactivity  toward  electrt^toiles,  and  H'*'  is  certainly 
an  electrqphile,  we  have  investigaced  the  possibility  that  aqueous  aciditks  (pKa's)  may  coielate 
with  the  Is  jnin  of  the  ccxijugate  bases.  We  found,  for  a  wide  variety  of  carbtm,  oxygen  and 
nitrogen  acids,  that  this  is  indeed  the  case  [23,25,26].  In  tois  manner  we  are  able  to  predict  pKa 
values  even  for  very  strong  adds  for  which  aqueous  acidities  cannot  be  measured  because  of  the 
leveling  effect  of  water.  It  is  umxntant  to  mendmi  that  although  both  Vmio  and  1$^  can  be 
viewed  conceptually  as  indicative  of  reacdviQr  toward  electrqphiles,  Is^  appears  to  be  better 
suited  to  be  a  general  measure  of  aqueous  acidity  [6,26], 

As  an  extensiem  of  our  versus  pKa  studies,  we  have  investigated  toe  surface  I(r)  of 

the  conjugate  bases  of  a  series  of  substiuted  acetic  acids  [7].  The  pKa's  of  tois  series  of  adds 


X-H,C--C^'' 


'OH 


X  =  H,  CH3,  CH2OH.  NH2.  OH.  F,  a, 
CHO,  CF3,  CN,  NO2 


have  been  used  as  a  basis  for  defining  iiwiuctivc  substituent  constants,  <Ti  [34,35].  In  light  of  our 
rxent  findings,  it  seemed  reasonable  therefore  to  antidpate  that  Is,n»in  of  the  conjugate  bases  of 
substituted  acetic  acids  might  correlate  with  cTi  valtws  of  the  corresponding  substituent  groups. 
The  relationship  between  6-31G*/3-21G  Is^  and  oj  for  the  molecules  listed  above,  excluding 


80 


CH2F-CXX)H,  was  found  ic  be  quite  good;  the  linear  correlation  coefficient  is  0.97.  The  point  for 
fluoroaceiic  acid  is  outlying,  as  we  anticipated  from  earlier  studies  of  Vmin  and  is,min  of  NH2-X 
systems  [1,6]  and  other  experimental  and  theoretical  studies  [25,36-39]  supporting  the  view  that 
fluorine  has  a  limited  capacity  for  accepting  addidonal  electronic  charge,  despite  an  initial  strong 
attraction  for  it. 

Our  relationship  between  Isjnin  and  Oi  presents  a  promising  approach  for  theoretically 
determining  substituent  coritants.  In  principle,  the  same  procedure  could  be  carried  out  for  other 
reaction  series  used  as  means  for  determining  substituent  constants  (Oi,  Gm,  etc.),  e.g. 
bicyclooctane  carboxylic  acids  (Y).  protonated  quinuclidines  (YD  and  benzoic  acids  (YID  [35]. 


Y  YI  YE 


Fur:-*  Directions 

rtS  part  of  our  continuing  smdies  of  molecular  reactive  behavior,  we  plan  to: 

1)  uivestigate  the  relationship  between  i(r)  and  polarizability,  possibly  leading  to  the 
determination  of  local  polarizabilities; 

2)  examine  possible  correlations  between  V(r)  and  the  solute  parameters  and  ’'''as 

suggested  by  Dr.  R.  W.  Taft  and  Dr.  M.  H.  Abraham; 

3)  explore  the  use  of  the  para  -substituted  aniline  frameworic  in  measuring  substituent  effects 
as  suggested  by  Dr.  C.  Hanscb;  and 

4)  compute  surface  i(r)  for  the  conjugate  bases  of  a  variety  of  strong  acids  with  the  aim  of 
predicting  pKa's. 


We  greatly  appreciate  the  support  of  this  work  by  the  Office  of  Naval  Research  (contract 
#N0(X)14-85-K-O2l7)  and  by  the  Army  Research  Office  (contract  #DAAI..03-90-G-0205). 
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AB  INITIO  QUANTUM  CHEMICAL 
INVESTIGATIONS  OF 
SQLVATQCHRQMIC  AND 
SUBSTITUENT  PARAMETERS 

We  have  found  several  correlations  to  exist 
between  computed  properties  [the  electrostatic 
potential  V(r)  and  the  average  local  ionization 

energy  I(r)]  and  solvatochromic  and  substituent 
parameters.  These  relationships  are  listed  below. 

Nature  of  Correlation: 

1 .  Hydrogen-bond-acceptor  parameter  P  vs. 

V(r)  minima 

2 .  Hydrogen-bond-donor  parameter  a  vs. 

surface  V(r)  maxima 

3 .  Hammett  constants  Cm  and  Cp  vs. 

surface  I(r)  minima 

4 .  Inductive  substituent  constants  Oi  vs. 

(a)  V(r)  minima 

(b)  surface  I(r)  minima 
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ELECTROSTATIC 

POTENTIAL 


Za  is  the  charge  on  nucleus  A,  located  at  Ra,  and 
p(r)  is  the  electronic  density  function  of  the 
molecule. 

The  electrostatic  potential  is  a  real  physical 
property  which  expresses  the  net  electrical  effect 
of  the  nuclei  and  electrons  of  a  molecule,  and  has 
emerged  as  an  effective  tool  for  studying  the 
reactive  behavior  of  molecules,  including  both 
electrophilic  and  nucleophilic  processes,  and  also 
intermolecular  hydrogen  bonding  and  recognition 
interactions. 
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NEGATIVE  REGIONS  OF 
ELECTROSTATIC  POTENTIAL 


The  electrostatic  potential  around  a  ftee 
neutral  atom  is  positive  everywhere.  In  response 
to  electronic  rearrangements  that  occur  when 
atoms  interact  to  form  a  molecule,  regions  of 
negative  potential  normally  develop.  Most  of 
these  can  be  classified  into  three  general 
categories: 

1 )  lone  pair  regions  of  heteroatoms : 

e.  g.N,  0,  F,  Cl,  S,  Br 

2)  TC  regions  of  unsaturated  molecules: 

eg-  H2C=CH2 

3)  C-C  bond  regions  of  strained  hyrocarbons: 
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CALCULATED  ELECTROSTATIC 
POTENTIAL  IN  THE  MOLECULAR 
PLANE  OF  PYRAZINE 


Vmin  =  -82.4  kcal/mole  (★) 


RELATIONSHIPS  BETWEEN 
HYDROGEN-BOND-ACCEPTOR 
PARAMETER  P  AND 
ELECTROSTATIC  POTENTIAL 
_ MINIMA _ 

The  parameter  p  has  been  interpreted  as 
providing  a  measure  of  a  solvent's  ability  to 
accept  a  proton  in  a  solute-to-solvent  hydrogen 
bond.  We  have  found  correlations  between  p  and 
the  most  negative  electrostatic  potentials  (Vmin) 
associated  with  the  hydrogen-bond-accepting 
heteroatoms  in  four  families  of  molecules,  t^en 
separately: 

a)  azines 

b)  primary  amines 

c)  alkyl  ethers 

d)  molecules  containing  double- 
bonded  oxygens 


MOLECULES  CONTAINING  DOUBLE-BONDED  OXYGENS: 


Molecule 


P  .  y W  Molecule 

^  (kcal/mole) 


0=P(CH3)3 


1.02  -94.5 


0.85  -73.4 


0=S(CH3)2 


0 

II 

c 

(ch3)2N^  ^ai3 


0.80  -68.0 


0.76  -69.5 


0.76  -65,2 


O 

II 

c 

(CH3)jN  CF3 


11 

c 


HgC/  ^0CH3 


o 


CHsOLCjf 


o 

11 


OCH, 


B  Vmin 

(fccal^le) 


0.46  -48.2 


0.44  -53.5 


0.42  -46.6 


0.40  -54.0 


0.37  -52.5 


O 

II 

C 

(CH3)2N'^  H 


o 

II 

a 


“CH,  0.49  -56.4 


O 


0,69  -60.5  OHjCCCHjCl 


0 

II 

H3CCCCI3 


0.14*  -13.0 


O 

il 

H3CCCH3 


0.48  -52.9 


Dr.  R.  W.  Taft  has  pointed  out  that 
this  value  may  be  unreliable. 


Correlation  Coefficient  =  0,95 
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BETA 


_ p  versus  Vmin 

AZINES:  Correlation  Coefficient  =  0.96 


PRIMARY  AMINES:  Correlation  Coefficient  =  0.98 


CH3(CH2)3NH2 


CH2NH2 


F3CCH2NH2 


AKYI  ETHERS:  Correlation  Coefficient  =  0.94 


0[CH(CH3)2]2 

0(CH2CH2CH3)2 


0(CH2CH3)2 

H3COCH2CH20CH3 
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POSITIVE  REGIONS  OF 
ELECTROSTATIC  POTENTIAL 


Maxima  in  V(r)  occur  only  at  the 
positions  of  the  nuclei  of  molecules. 
These  reflect  the  magnitudes  of  the  nuc¬ 
lear  charges  and  do  not  indicate  relative 
affinities  for  nucleophiles.  Thus,  there  is 
no  criterion  for  predicting  nucleophilic 
attack  that  corresponds  directly  to  Vmin 
for  electrophilic  attack. 

It  is  nevertheless  possible  to  use  V(r) 
to  interpret  and  predict  nucleophilic 
processes.  One  useful  approach  involves 
examining  V(r)  on  a  three-dimensional 
surface  encompassing  the  molecule. 
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RELATIONSfflPS  BETWEEN 
HYDROGEN-BOND-DONOR 
PARAMETER  a  AND  SURFACE 
ELECTROSTATIC  POTENTIAL 

MAXIMA 


We  have  shown  for  two  groups  of 
hydrogen  bond  donors  that  surface  V(r) 
maxima  (Vs,max)  correlate  with  the  para¬ 
meter  a,  which  is  viewed  as  a  measure  of 
a  solvent's  ability  to  donate  a  proton  in  a 
solvent-to-solute  hydrogen  bond. 


y(r)  Ranges  (kcal/moie): 

BLACK:  Greater  than  27. 
DK.GRAY:  Between  10 
and  27. 

LT.GRAY:  Between  0 
and  10. 
WHITE:  Negative. 


Vs^ax  ~  +28.6 

[0.001  contour  of  electron  density; 
hydroxyl  group  faces  viewer.] 
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Surface  V(r)  of  Ethanol 


CORRELAHONS  BETWEEN  a  AND  Vs^ax 


1.  Molecules  containing 
OH.gjCMSI 
F3CCH(0H)CF3  , 
F3CXH2OH 
H2O 

CH3COOH 


CH3OH 

HOCH2CH2OH 

CH3CH2OH 

CH3CH2CH2CH2ai 

CH3CH2CH2OH 


CH3CH(0H)CH3 


(CH3)3C0H 


2.  Molecules  without 
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Vs, max  (kcai/mole) 


AMINE  NITROGEN  Vmin  OF  NH2-X 
MOLECULES  AS  MEASURES  OF  THE 
TOTAL  ELECTRON-ATTRACTING 
TENDENCIES  OF  SUBSTITUENTS 

We  have  found  correlations  between  the 

amine  nitrogen  Vmin  of  NH2-X  molecules  and  ai 

or  ai  +  Or  when  or  >  0  (electron-withdrawing), 
making  it  possible  to  estimate  either  of  these 
quantities. 


Examples: 


NH2-X 

Vmin 

(kcal/mole) 

cn 

X  =  H 

-109 

0.00 

X=NCH3F 

-91 

0.28* 

X=NHF 

-83 

0.43* 

X=NF2 

-77 

0.53* 

X=CN 

-68 

0.57 

*Estimated  from  our  \ 

min  VS.  (ai  +  gr)  when  or  >  0  correlation. 
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AVERAGE  LOCAL 
IONIZATION  ENERGY 


m 


P;(r) 


e. 


P(r) 


pi(i^)  is  the  electronic 
density  of  the  ith  molecular 

orbital  at  the  point  Y,  £i  is 
the  orbital  energy  of  the  ith 

molecular  orbital  and  p(Y) 

is  the  total  electronic 
density. 
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BLACK:  less  than  11.8  eV. 

GRAY:  between  11.8  and  12.1  eV. 
WHITE:  greater  than  12.1  eV. 
MINIMA;  11.5,  11.7  eV. 


MINIMA:  12.1  eV  (ABOVE  CARBONS) 


BLACK;  less  than  13.2  eV. 
WHITE:  greater  than  13.2  eV. 
MINIMA:  13.0  eV. 


Imin 


HAMMETT  CONSTANT 


CORRELATION 
COEFFICIENT  =  0.99 


X 


X  =  NH2,  CH3  H, 
F,  CHO,  NO2 


F,N 


LT.GRAY:  Less  than 
12.8  eV. 

DK.GRAY:  Greater  than 
12.8  eV. 

MINIMA:  12.7, 12.8  eV. 


PREDICTIONS 


Hammett  Constants 


Taft  constants,  predicted 
from  earlier  work  and 


Gp 

NF2  0.49 

NHF  0.06 


0.54 


Gp  «  oi  +  OR 
Gl  GR 

0.53  -0.04 

0.43  -0.37 
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CORRELATIONS  BETWEEN  pKa  AND 
CONJUGATE  BASE  SURFACE  i(r)  MINIMA 


Our  finding  that  l(r)  on  molecular  surfaces  provides 
quantitative  predictions  of  reactivity  toward  electrophiles  led 
us  to  investigate  whether  i(r)  might  be  related  to  acidity, 
since  the  latter  reflects  the  tendency  to  interact  with  the 
electrophile  H**" . 

1 .  Azines  and  Azoles: 


CORRELATIONS  BETWEEN  pKa  AND 
CONJUGATE  BASE  SURFACE  i(r)  MINIMA 


2.  Carbon.  Nitrogen  and  Oxygen  Acids: 


CH3CN 

ClOH 

CHqPCOOH 

CH4 

NH2CONH2 

NH2NO2 

naOl 

A 

HOOH 

CH3COOH 

CHCI2C00H 

NHs 

CH2(CN)2 

HCOOH 

HOOCCOOH 

<Q> 

CH3NO3 

CH2(N02)2 

CH(N03)3 

H2CCH2 

HCN 

HNO2 

CH(CN)3 

HCCH 

CH2CINO2 

CH2aCOOH 

CH(N02)2CN 

RELATIONSHIP  BETWEEN  ai  AND 
SURFACE  !(r)  MPmiA 


The  pKa  s  of  substituted  acetic  acids  have  provided  a 
basis  for  defining  the  inductive  substituent  parameters  oi.  As 
an  extension  of  our  Is^nin  vs.  pKa  studies,  we  have 
investigated  the  surface  I(r)  of  the  conjugate  bases  of  a  series 
of  substituted  acetic  acids  in  relation  to  the  Gi  values  of  the 
substituents. 

Series: 

X-CHj-CXKD'  where  X  =  H,  CH3,  CH2OH,  NH2, 

OH,  CHO,  CF3,  Cl, 
CN,N02 
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A  Qsmprehensive  Approach  to  Structure-Activity  Relationship  * 


Corwin  Haosch.  David  Hoekman  and  Peng  Li 
Department  of  Chemistry,  Pomona  College.  Claremont.  CA  91711 


One  of  the  largest  of  the  Big-Science  projects,  although  it  is  not 
usually  thought  of  in  this  sense,  is  the  study  of  how  organic  chemicals 
interact  with  and  affect  living  organisms  (or  parts  thereof  such  as 
enzymes,  organelles,  etc.).  Many  billions  of  dollars  are  spent  each  year  by 
the  drug  companies  of  the  world  in  this  effort  to  say  nothing  of  all  of  the 
work  in  pesticide  science.  The  environmental  toxicology  of  both  industrial 
and  natural  products  is  also  of  increasingly  intense  interest.  Thousands  of 
academic  scientists  are  studying  the  basic  biochemistry,  molecular  biology 
and  physiology  of  xenobiotics  and  we  have  no  idea  of  the  effort  being 
expended  in  the  area  of  chemical  warfare.  What  is  needed  in  all  of  these 
areas  is  better  means  for  predicting  the  biological  activity  of  untested 
chemicals  from  a  knowledge  of  their  chemical  structure.  The  dimensions 
of  this  problem  are  enormous,  in  the  first  place  because  there  are  almost 
an  infinite  number  of  possible  organic  compounds.  Secondly  there  are  an 
unknown  number  of  species  of  life;  animals,  plants,  birds,  insects,  bacteria, 
viruses,  etc.  containing  a  huge  number  of  biochemical  systems.  As  yet 
there  has  been  no  systematic  approach  to  this  subject.  Scientists  from 
many  fields,  quantum  chemistry,  physical  organic  chemistry,  biochemistry, 
medicinal  chemistry,  pharmacology,  computer  technology  and  applied 
mathematics  are.  in  their  own  ways,  studying  what  might  be  called 
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selective  or  comparative  toxicology.  One  of  the  best  ways  to  organize  this 
fast  growing  multidisciplinary  effort  is  around  the  subject  of  quantitative 
structure-activity  relationships  (QSAR). 

The  OSAR  paradigm  was  first  outlined  in  1964,  and  although  great 
progress  has  been  since  then,  there  is  enormous  confusion  at  present 
because  of  the  multitude  of  approaches  to  the  subject  and  the  lack  of 
organized  data.  QSAR  are  now  being  published  in  an  astonishing  variety  of 
journals,  monographs  and  special  publications.  Only  a  handful  of  the 
world  s  libraries  would  contain  all  of  these  sources.  Besides  collecting  such 
data  in  computerized  form  there  is  an  urgent  need  for  some  kind  of 
common  language  to  discuss  QSAR.  We  believe  that  it  is  now  time  to 
attempt  this  even  though  we  by  no  means  have  the  knowledge  for  an 
enduring  system.  Even  a  tentative  system  and  language  can  do  much  to 
aid  discussion. 

The  Med-Chem  project  at  Pomona  College  has  been  working  at 
building  a  database  of  published  and  unpublished  QSAR  for  many  years.  It 
is  only  recently  that  the  software  has  evolved  to  something  approaching  a 
finished  product  which  is  user  friendly.  The  task  now  is  to  greatly 
increase  the  bank  of  QSAR  and  clean  up  and  standardize  many  of  the  early 
data  entries. 

The  database  has  been  organized  into  two  major  forms:  1)  physical 
organic  chemistry  and  2)  biomedicinal  OSAR.  Each  of  these  divisions  is 
split  into  two  parts:  1)  the  raw  data  (biological  activity,  chemical 
structures  in  terms  of  SMILES  and  parameters  used  to  derive  the  QSAR) 
and  2)  the  QSAR  equations  and  their  statistics. 

These  can  be  searched  concurrently  or  independently  in  a  multitude 
of  ways..  The  data  bank  is  also  lied  to  a  model-building  regression 
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program  into  which  any  particular  data  set  can  be  loaded  and  studied  with 
new  or  additional  parameters  and  if  an  improved  QSAR  is  found  it  can  be 
returned  to  the  bank  to  displace  the  old  one. 

There  are  three  operational  modes  in  which  to  study  the  database: 
browse,  search  and  show.  Using  the  Browse  Menu  (Table  II)  one  simply 
enters  a  menu  number  to  get  c.  listing  of  all  the  items  of  this  class  now  in 
the  database.  Entering  1  lists  all  of  the  various  types  of  commoii  names 
which  have  been  used  to  characterize  system  (eg.  cat,  E.  coli,  urease,  red 
cell,  protoplast,  worm,  etc.).  One  uses  the  seau'ch  mode  to  find  the  items  of 
interest  and  then  goes  to  the  show  mode  to  display  the  hits  found  in  the 
search. 

Organization  of  QSAR  database 


In  the  bio  database  each  set  is  assigned  nine  key  title  elements 


1.  System 

2.  Class 

3.  Type  of  compound 

4.  Action 

5.  Reference 

System  is  assigned  a  common  name  such  as  frog  nerve,  mouse,  E.  coli 
chloroplast,  etc.  to  characterize  how  the  chemicals  were  tested.  Similarly  a 
common  name  is  supplied  to  compounds  studied;  benzodiazepines,  benzoic 
acids,  miscellaneous,  etc.  Under  action  the  type  of  end  point  observed  is 


6.  Source 

7.  Check 

8.  Notes 

9.  Set  number 

10.  Date 


107 


noted:  LD50,  MIC,  increase  in  blood  pressure,  l/Km  or  kcat.  etc.  Item  5  is 
simply  reference  to  the  article  from  which  the  data  were  taken. 

The  OSAR  are  assigned  to  a  particular  class  (Table  III  and  IV)  so  that 
all  of  the  data  for  a  particular  class  can  be  studied  as  a  unit.  For  example 
one  might  wish  to  study  all  of  the  OSAR  on  hydrolases  which  could  be 
retrieved  by  means  of  the  label  B2C.  There  are  so  many  different  common 
names  that  it  becomes  important  to  have  the  systems  grouped  by  class. 
For  instances  under  animals  (B6A)  multiple  names  such  as  bovine,  cow, 
steer  or  mouse,  mice,  murine  might  be  used.  To  avoid  missing  data  sets 
one  can  retrieve  everything  under  B6A  and  scan  the  results  using  search. 

Under  source  the  name  of  the  person  entering  the  data  is  listed  and 
under  notes  the  definition  of  unusual  variables  or  procedures  can  be  given. 
Each  data  set  is  assigned  a  set  number. 

It  is  planned  to  have  each  data  set  checked  by  a  second  person. 
Searching  under  check  we  now  find  that  615  data  sets  have  been 
independently  checked  for  errors.  However,  this  does  not  mean  that  the 
current  equation  is  up  to  the  present  methods  or  is  based  on  this  latest 
parameter  values.  Most  of  the  QSAR  were  entered  before  the  advent  pf 
the  bilinear  model  and  many  can  now  be  improved  by  its  use. 

For  each  compound  a  SMILES^-^  notation  is  entered  so  that  one  can 
easily  search  the  database  for  any  structure. 

The  physical  database  is  organized  somewhat  differently  as  shown  in 
Table  IV.  There  is  nothing  comparable  to  the  biological  system  so  that 
here  system  refers  to  the  solvent  used,  or  the  gas  phase,  for  example.  The 
individual  classifications  need  pruning  and  reorganization,  but  as  they 
stand  they  do  provide  a  useful  means  for  studying  different  areas  of 
physical  organic  chemistry.  Thus  one  can  compare  oxidation-reduction 
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OSAR  from  simple  organic  reactions  in  homogeneous  solution  with 
biological  oxidation  reactions  by,  say,  oxidoreductase  or  microsomes. 

Common  Nomenclature 

The  need  for  a  common  language  to  discuss  structure-activity 
relationships  is  obvious  and  urgently  needed.  Many  different  parameters 
and  mathematical  models  are  now  being  published  so  that  the  field  is  in  a 
state  of  bewilderment.  While  any  parameters  or  regression  models  can  be 
used  in  our  current  system,  it  is  largely  based  on  three  major 
physicochemical  properties  of  substituents  or  molecules. 

1.  Electronic  S*.  S,  S*.  S;.  F.  R,  SI,  SR,  S* 

2.  Hydrophobic  log  P,  Pi 

3.  Steric  ES,  ESC.  MR.  MV 

in  the  above  S  -  a.  F,  SI  and  S*  are  fieid/inductive  parameters  while  R  and 
SR  are  resonance  constants  for  substituents.  A  few  QSAR  based  on 
molecular  orbital  indices  are  beginning  to  be  entered  so  that  terms  such  as 
EHOMO,  ELUMO,  qz,  etc.  are  entering  the  database. 

Unless  otherwise  noted  P  and  it  (Pi)  are  from  the  octanol-water 
system  and  refer  to  the  unionized  forms  of  the  chemicals.  If  not,  P'  and  Pi' 
are  used  to  note  the  difference  and  these  must  be  defined  in  the  notes. 

Q 

The  Taft  steric  parameter  (Es)  and  Hancock's  variation  of  it  (E  )  refer 
to  substituents  while  molar  refractivity  (MR)  or  molar  volume  (MV)  may 
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also  refer  to  the  size  of  the  whole  molecular.  The  sterimol  parameter  are 
labeled  L,  Bl,  B2,  etc.  refer  only  to  substituents. 

Of  course,  any  parameters  can  be  used  such  as  the  number  of  a-H 
(AH),  hydrogen  bonding  parameters,  etc. 

In  entering  the  dependent  variable  for  biological  activity,  where  ever 
possible,  log  1/C  is  used.  C  is  the  molar  concentration  of  chemical 
producing  a  standard  biological  response  in  a  fixed  time  interval.  In  some 
instances  responses  are  given  relative  to  a  standard  substance  so  that  the 
term  RBR  for  relative  biological  response  is  used.  Rate  constants 
(especially  for  enzymic  QSAR)  are  sometimes  available  and  these  must  be 
defined. 

There  are  two  modes  for  studying  the  data:  search  and  display. 
Table  V  shows  19  ways  in  which  the  search  can  be  structured.  It  also 
notes  that  in  the  Bio  database  as  of  July  1,  1991  there  were  2447  data  sets 
for  which  2869  equations  had  been  derived  and  stored.  Sometimes  more 
than  one  equation  is  stored  for  a  particular  data  set  and.  in  fact,  any 
number  can  be  stored.  At  present  there  are  3116  data  sets  in  the  physical 
database.  We  are  now  cooperating  with  several  laboratories  to  greatly 
expand  the  number  of  QSAR  especially  for  biological  data. 

Search  Procedure  For  Bio  Database 

With  the  search  menu  (Table  V)  one  can  formulate  various  kinds  of 
searches.  Entering  1  cat  will  find  all  of  the  QSAR  on  cats.  Entering  2  B4B 
will  /ield  all  equations  on  bacteria.  Entering  3  benzoic  acid  will  find 
data  sets  on  benzoic  acid.  Entering  4  LD^o  yields  sets  with  this  type  of 
action.  Searching  on  reference  (5)  one  can  look  for  all  references  by  the 
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name  of  any  author  (eg.  5  Jones)  or  one  can  search  by  year  or  by  specific 
article,  say,  5  J-  Med,  Chem.  Soc.  76,  4525  (1954).  The  numbers  1-5 
refer  to  Table  V. 

More  complex  studies  can  be  organized.  A.  Entering  12  clocccclO 
(SMILES  notation)  will  uncover  all  QSAR  data  sets  containing  phenol  as  one 
of  the  compounds  studied.  At  present  there  are  185  such  data  sets.  By 
entering  15  "  S  "  one  searches  this  set  of  185  for  all  QSAR  on  phenols 
which  have  a  term  in  the  Hammett  parameter  a.  This  turns  out  to  be  only 
15. 

B.  Searching  the  whole  database  with  15  "  S  "  yields  298  hits, 
searching  on  15  '  S*  "  finds  44  QSAR  having  a  term  in  a*  and  using  "  S"  " 
yields  69  examples  with  terms  in  a  The  hit  list  of  298  could  be 
approached  in  another  way  by  entering  16  2  <  "  S  "  <  3.4  which  finds  all 
equations  Laving  a  coefficient  (p)  with  o  between  2  and  3.4.  This  produces 
24  hits. 

C.  A  search  can  be  made  for  equations  of  a  certain  quality  by 
entering,  for  instance.  18  i  >  .95  which  finds  all  equations  with  a 
correlation  coefficient  greater  than  0.95.  At  present  this  yields  1238  hits. 
However,  this  does  not  mean  all  of  these  equations  are.  indeed,  of  high 
quality  since  there  are  many  small  sets  which  have  a  low  ratio  of  data 
points/variable.  To  consider  larger  sets  we  can  search  under  the  command 
18  n  >  50  which  shows  that  there  are  now  171  data  sets  having  50  or 
more  data  points  each. 

D.  To  study  steric  effects  in  the  reactions  of  hydrolases  we  can  enter 
the  commands:  2  B2C  and  15  "  Es  '  which  uncovers  37  such  QSAR. 

E.  To  ascertain  the  importance  of  the  hydrophobic  character  of 
insecticides  we  first  isolate  all  insect  QSAR  by  searching  with  B6B  and  find 
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80  such  equations.  The  "  or  "  search  15  "  Pi " "  P  “  uncovers  the  fact  that 
all  equations  contain  a  term  in  log  P  or  it.  As  of  the  present,  the  entry  of 
hydrophobic  terms  has  not  been  as  tightly  standardized  as  we  plan  to 
make  it.  Using  the  browse  mode  and  entering  1 0  we  can  inspect  all  of  the 
various  labels  used  for  the  hydrophobic  properties  of  chemicals  and  with 
these  we  can  discover  just  how  important  the  hydrophobic  terms  are  for 
the  whole  database.  Entering  15  "  Pi"  "  P"  "  P'  "  "  log  P  "  we  obtain 
2447  hits  on  the  2854  equations  or  86%  of  the  QSAR  contain  a  hydrophobic 
term.  This  estimate  is  actually  a  bit  low  since  there  are  a  number  of  other 
ways  in  which  hydrophobicity  has  been  termed.  We  estimate  that  at  least 
90%  of  the  OSAR  contain  a  hydrophobic  term.  A  similar  search  for  QSAR 
with  electronic  terms  uncovers  only  651  examples. 

Another  illustration  of  a  complex  search  is  to  first  isolate  all  QSAR  on 
animals  (2  B6)  to  obtain  286  sets.  Then  using  the  "  and  '  search  15  "  P  ", 
"  S  ", "  Es  "  we  find  a  total  of  eleven  equations  for  animals  containing  ail 
three  terms. 

Displaying  the  information  obtained  in  a  search  is  done  by  entering: 
show.  The  show  menu  (Table  VI)  is  essentially  the  same  as  that  for 
search  (Table  V). 

To  obtain  all  information  on  each  data  set  the  entry  /and  1.20  is 
made.  To  display  only  the  references  for  each  QSAR  one  enters  /stud  5. 

There  is  also  an  important  sort  option.  For  instance,  under  B  above 
we  found  69  equations  containing  a  term  in  a".  These  equations  could  be 
sorted  and  printed  according  to  the  p  associated  with  each  a  '  term:  /and 
/sort-16  1.5  15.18.  The  program  then  asks  for  the  item  to  sort  on  and 
we  enter  "  S"  ".  It  is  then  found  that  the  lowest  value  of  p  is  -2.42  with  all 
QSAR  ordered  according  to  increasing  size  of  p. 
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Any  data  set  uncovered  by  the  above  procedures  can  by  means  of  its 
set  ^  be  loaded  into  the  regression  program  for  further  study. 

Motivation  For  Building  a  QSAR  Data  Bank 

There  are  obvious  reasons  of  convenience  for  constructing  the  above 
described  system.  But  the  most  important  reason,  which  we  have  not 
mentioned,  is  that  of  establishing  the  validity  of  a  QSAR.  This  can  be 
illustrated  by  the  following  example. 

fH3 

Mutagenicity  of  X-C6H4-CH2N-N-O  in  the  Ames  test 

log  1/C  -  3.55  o- 3.88  a2^  1.62  \p- 5.11  (1 

n  -  13.  r  -  0.873 

log  1/C  «  0.92  2.08  a  -  3.26  (2 

n  -  12,  r  -  0  891 

In  these  QSAR  C  is  the  concentration  of  nitrosoamine  producing  a 

3  V 

standard  set  of  mutations  and  Xp  is  a  complex  connectivity  parameter 

developed  by  Kier  and  Hall^  and  used  in  this  instance  by  Singer^  to  form 
QSAR  1.  We  have  developed  QSAR  2  from  the  same  data.  Which  equation 
provides  the  most  useful  information  and  in  which  should  we  place  our 
confidence?  Statistics  are  of  no  help  since  the  values  of  r  are  close  to  each 
other.  One  data  point  was  dropped  to  obtain  eq.  2,  but  eq  1  uses  an 
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additional  parameter.  We  believe  that  eq.  2  is  the  one  to  select  because  it 
contains  terms  which  tie  it  into  the  basic  approach  from  physical  organic 
chemistry  and  in  particular  because  it  contains  a  hydrophobic  term.  As 
discussed  below  this  is  a  hallmark  of  QSAR  for  the  Ames  test. 

"Proof”  of  a  concept  in  science  generally  arises  as  the  number  of  facts 
which  can  be  rationalized  by  it  in  a  self  consistent  way  increases.  There  is 
no  fundamental  set  of  laws  or  even  a  single  such  law  which  can  be  used  as 
a  starting  point  to  relate  chemical  structure  with  the  biological  activity  of  a 
set  of  complex  compounds  acting  in  a  mouse  or  even  with  an  enzyme. 
Except  for  very  small  molecules,  quantum  chemistry  cannot  precisely 
define  the  electron  distribution  in  a  compound  or  how  it  might  interact 
with  a  reagent  or  a  receptor.  As  shown  above  the  operationally  defined 
hydrophobic  parameters  are  of  great  importance,  but  we  do  not  know 
what  all  is  contained  in  log  P  or  n. 

Steric  effects  in  QSAR  are  the  most  difficult  of  all  to  deal  with. 
Statistical  tests  such  as  correlation  coefficients  or  F  tests  are  no  firm  guide 
to  the  value  of  parameters  or  mathematical  models  in  a  general  sense.  We 
believe  that  only  via  lateral  correlation  of  a  given  QSAR  with  many  others 
can  one  develop  confidence  in  the  validity  of  any  approach  to  the 
correlation  of  structure  with  biological  activity.  In  so  far  as  possible  the 
biological  QSAR  must  be  related  to  the  far  better  understood  reactions  of 
simple  organic  compounds  in  homogeneous  solution  or  to  enzymic  or 
receptor  QSAR.  To  do  this  requires  a  language  based  on  the  concepts  and 
nomenclature  of  physical  organic  chemistry  and  a  very  large  database  of 
QSAR  with  which  comparisons  can  be  made.  The  following  are  a  very  few 
examples  which  illustrate  the  utility  of  the  present  QSAR  database  and  the 
current  nomenclature.  Obviously,  this  will  evolve  with  time. 
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Lateral  Correlation  in  Terms  of  Hydrophobic  Parameters 

There  are  simple  model  systems  which  can  be  used  to  compare  with 
the  effect  of  hydrophobic  compounds  on  the  more  complex  biological 
systems.5 

ROH  penetration  into  phosphatidyl  choline  monolayers 

log  1/C  -  0.87  log  P 0.62  (3 

n  -  4.  r  -  0.994.  s  -  0.082 

ROH  disagregation  of  silanized  glass  beads 

log  I /C  -  0.92  log  P  -  0.71  (4 

n  -  4,  r  -  0.999,  s  -  0.035 

Change  in  electrical  resistance  of  black  lipid  membrane  caused  by  ROH 

log  1/C  -  1.08  log  P  -  0.40  (5 

n  -  7.  r  -  0.983,  s  -  0.280 

ROH  inhibition  of  the  aggregation  of  bovine  blood  platelets 

log  1/C  -  1.00  log  P  +  0.18  (6 

n  »  4.  r  -  0.997,  s  »  0.064 
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The  above  OSAR  can  be  compared  to  examples  found  in  the  present 
data  bank  by  looking  at  QSAR  on  nerves. 

ROH  causing  10  mV  decrease  in  lobster  axon 

log  1/C  -  0.87  log  P  -  0.24  (7 

n  »  5.  r  =-  0.993,  s  -  0.100 

Salicylate  analogs  causing  a  20  mV  increase  in  mollusc  buccal  ganglion 

log  1/C  -  0.84  log  P  + 3-31  (8 

n  -  30.  r  «  0.979.  s  -  0.177 

Inhibition  of  contraction  of  frog  sartorius  muscle  by  miscellaneous 
compounds 

log  1/C  -  0.88  log  P  +  0.63  (9 

n  -  25.  r  -  0.955.  s  -  0.297 


The  dependence  of  nerve  (membrane?)  perturbation  on  log  P 
parallels  that  seen  in  the  artificial  model  systems.  While  there  are 
relatively  few  data  points  supporting  each  model  QSAR  so  that  no  single 
equation  would  give  one  the  confidence  to  place  much  weight  on  the 
comparisons,  when  all  are  taken  together  the  commonality  is  striking  and 
deserves  further  study. 

Studies  on  mutagenesis  by  various  organic  compounds  in  the  Ames 
test  yields  the  following  QSAR. 
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Mutagenicity  of  X-C6H4N"«NN{CH3)  in  TA92 


log  1/C  -  1.04  log  P  -  1.63  0-"  0.06  (10 

n  -  17.  r  -  0.974,  s  -  0.315 

Mutagenicity  of  X-quinolines  in  TAIOO 

log  TAIOO  -  1.08  logP  -  56.3  Qa  -  5.91  (11 

n  -  21.  r  -  0.827,  s  -  0.645 

Mutagenicity  of  arylamines  in  TA98 

log  TA98  -  1.08  log  P  1.28  eiUMO  +  1-46  II  7.20  (12 

n  -  88.  r  -  0.898,  s  -  0.860 

Mutagenicity  of  arylamines  in  TAIOO 

log  TAIOO  =»  1.13  log  P  +  1.66  enOMO  -  0.63  elUMO  +1.09  (13 

n  -  67.  r  -  0.881,  s  -  0.714 

Mutagenicity  of  aryl  nitro  compounds  in  TAIOO 

log  TAIOO  -  1.10  log  P  -  3.37  log  (p  •  10  ^08  P  +  1)  +  1.84  eluMO 

-3.35  la +5.64  (14 

n  -  1 17,  r  -  0.858,  s  -  0.920 
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Mutagenicity  of  aryi  nitro  compounds  in  TA98 


log  rA98  -  0.65  log  P  -  2.90  log  (0  •  10  iog  P  +  1)  -  1.38  bluMO 
+  1.88  II  -  2.89  la  -  4.15  (15 

n  “  188,  r  «  0.900,  s  -  0.886 

In  eq.  10  C  is  the  molar  concentration  producing  30  mutations  above 
background  in  10*  bacteria.  In  the  other  equations  log  TA  refers  to 
reveptants/nmol.  In  eq.  11,  qz  refers  to  the  change  on  position  2  of  the 
quinoline  ring  calculated  using  the  AMI  procedure.  The  variable  II  in  eq. 
12  and  15  is  assigned  the  value  of  1  for  all  compounds  containing  more 
than  2  fused  rings  (eg.  anthracene,  carbazol,  acridines,  etc.)  and  0  for  those 
containing  less  (eg.  benzene,  naphthalene,  quinoline,  etc.).  It  is  of  interest 
that  this  term  occurs  with  the  TA98  organisms  in  eq.  12  and  13  but  not 
with  the  TAIOO  organisms.  Thus  one  has  some  evidence  for  its  validity 
beyond  statistics.  The  indicator  variable  la  refers  to  5  complex 
hydrocarbons  which  have  much  lower  than  expected  activity.  Except  for 
eq.  10,  the  other  QSAR  for  mutagenesis  are  not  sharp  correlations, 
however,  it  is  found  that  the  log  P  terms  in  ail  of  the  equations  are  the 
most  important  variables  and  moreover  except  in  one  example  the  rising 
slope  has  a  coefficient  near  1.  Thus  eqs.  10-15  with  lateral  correlation  to 
eq.  2  help  one  to  choose  between  eq.  1  and  2.  Not  only  does  eq.  2  have  a 
term  in  log  P  but  its  coefficient  is  near  that  seen  for  the  other  QSAR.  It  is 
only  by  building  up  matrices  of  such  QSAR  that  validity  for  any  particular 
approach  can  be  established. 
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Another  set  of  QSAR  which  again  shows  the  significance  of  the 
octanol-water  log  P  parameter  comes  from  the  studies  of  the  induction  of 
P450. 

50%  Induction  of  chick  embryo  P450  by  ROH 

log  1/C  -  0.78  log  P  M.46  (16 

n  -  7.  r  -  0.988.  s  -  0.095 

50%  Induction  of  chick  embryo  P450  by  pyrazoles 

log  1/C  -  0.85  log  P+  1.93  (17 

n  -  8.  r  -  0.970.  s  -  0.305 

50%  Induction  of  chick  embryo  P450  by  barbiturates 

log  1/C  -  1.02  log  P-H  2.75  (18 

n  •  9,  r  ■  0,984,  s  «  0.186 
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Induction  of  rat  P450  by  benzofurans  and  dioxins 


log  1/C  -  1.14  logP  +  0.64  B5  -  1.24  Eg  +  0.58  (19 

n  -  43,  r  -  0.908,  s  -  0.670 


Equations  16-19  show  the  great  importance  of  hydrophobicity  in 
the  induction  of  P450;  this  coupled  with  the  mutagenicity  of  lipophilic 
compounds  increases  concern  about  lipophilic  compounds  in  the 
environment.  In  the  case  of  the  arylamines,  triazenes  and  quinolines  P450 
is  necessary  for  activity.  This  is  also  true  of  aromatic  hydrocarbons. 
Carcinogenicity  has  also  been  shown  to  be  highly  dependent  on  log  P  as 
seen  in  eq.  20.  The  lingua  franca  of  the  octanol-waler  log  P  enables  us  to 
see  far  reaching  consequences  of  lipophilic  compounds  in  environmental 
toxicology. 

log  A  -  0.55  log  P  -  1.15  log  (6  •  10  P  M)  +  0.38  LK  >  0.47  ehOMO 
♦  1.98  (20 

n  -  161,  r  -  0.844.  s  -  0.347 
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In  this  expression,  A  is  skin  carcinogenic  potency  as  defined  by  the  Ibali 
index  and  LK  is  an  indicator  variable  which  takes  the  value  of  1  for 
instances  where  a  substituent  is  placed  on  the  L  or  K  region  of  the  aromatic 
hydrocarbon  or  heterocycle.  These  compounds  are  about  twice  as  potent 
as  expected,  presumably  because  substitution  at  these  points  prevents 
nonproductive  side  reactions  (oxidation). 

Analysis  of  a  set  of  literature  data  aroused  our  interest  in  the  toxic 
action  of  benzimidazoles. 

I75  Influenza  B  virus  by  benzimidazoles 

log  1/C  -  0.57  log  P  M.58  (21 

n  -  13i  r  *  0.931,  s  -  0.166 

The  question  that  arises  from  the  QSAR  is.  is  this  a  specific  toxic  effect  or 
nonspecific  activity  as  the  low  value  of  the  coefficient  suggests?  Entering 
the  search  16  .5  <  "  P  ”  <  .7,  1.3  <  const  <  1.7  yields  QSAR  for 
comparison  having  similar  slopes  and  intercepts. 

X 

Inhibition  of  B.  subtilis  by  HO-^_^COOR 

log  1/C  -  0.55  log  PM. 61  (22 

n  -  26,  r  -  0.920,  s  -  0.220 
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D 

Inhibition  of  B.  subtilis  by  X-C6H40CC(CH3)-CH2 

log  1/C  -  0.62  log  P+  1.53  (23 

n  -  10.  r  -  0.976,  s  -  0.214 

Inhibition  of  S.  typhosa  by  X-2-naphthols 

log  1/C  -  0.63  log  P+  1.32  (24 

n  -  22.  r  -  0.898 

Inhibition  of  hydroxyindoie  O-methytransferase  by  N-acyltrypta mines 

log  1/C  -  0.60  log  P^  1.49  (25 

n  -  21.  r  -  0.948.  s  -  0.170 

Inhibition  of  rat  mitochondral  respiration  by  aromatic  amines  and  phenols 

log  1/C  -  0.60  log  P  +  1.52  (26 

n  -  19.  r  -  0.925,  s  -  0.289 

Division  of  sea  urchin  egg  by  ROCONH2 

log  1/C  -  0.65  log  P  +  1.33  (27 

n  -  10.  r  -  0.975.  s  -  0.147 
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} 


Equations  21  -  21  show  that  this  is  a  typical  "nonspecific”  kind  of 
activity  shown  by  a  variety  of  chemicals  in  a. variety  of  systems. 

Lateral  Correlations  in  Terms  of  Electronic  Effects 

The  following  are  a  few  examples  which  show  how  electronic 
properties  of  bioactive  compounds  can  be  related  back  to  a  simple  organic 
reaction. 

Inhibition  (50%)  of  E.  coli  by  X-C6H4N-C-S 

log  1/C  -  2.27  o  +  4.31  ^28 

n-  9.  r  -  0.963,  s  -  0.161 

Equation  28  is  unusual  in  that  no  hydrophobic  effect  is  found.  The 
aromatic  isothiocyantes  are  very  reactive  compounds  and  may  inhibit 
growth  by  reaction  with  nucleophiles  near  the  membrane  surface. 

QSAR  28  can  be  compared  with  29  and  30  for  reaction  with  two 
types  of  nucleophiles. 

S 

X-CeH^N-C-S  +  CH3CH2OH  - ►  X-C6H4NHCOC2H5 

log  k  -  2.16  o  -  4.80  (29 

n  -  8,  r  •>  0.975.  s  -  0.181 
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X-C6H4N-C-S  +  C6H5NH2 


log  k  -  2.14  a  -  3.13  (30 

n  -  4.  r  -  0.994,  s  -  0.060 

The  value  of  p  for  the  three  OSAR  are  in  very  good  agreement  suggesting  a 
mechanism  for  the  tosic  action  of  the  isothiocyanates. 

0 

ii 

LD50  of  house  flies  by  X-C6H40P( 002115)2 

log  1/C  -  2.44  o- +  0.26  :f  -  0.61  (31 

n-  8.  r-  0.988,  s-  0.219 

0 

Inhibition  of  fly  head  acetylcholinesterase  by  X-C6H40P(0C2H3)2 

log  1/C  -  2.45  0-  -  0.56  Es3  +  4.82  (32 

n  -  13,  r  -  0.962,  s  -  0.408 

In  the  case  of  eq.  3 1  only  one  meta  substituted  congener  was 
examined.  Several  were  studied  for  eq.  32  and  it  was  found  that  for 
enzyme  inhibition  a  small  steric  effect  is  present  for  meta  substituents 
which  is  accounted  for  by  Es3.  Again  we  see  good  agreement  between  the 
two  values  of  p,  and  o'  is  Hammett  constant  of  choice  as  one  might  expect. 
A  small  hydrophobic  effect  is  seen  in  the  whole  fly. 
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Sulfonamides  inhibiting  carbonic  anhydrase  yield  QSAR  33: 


log  1/ki  -  0.95  a  *  0.54  tt  -  0.35  B5.3  +  6.29  (33 

n  «  31,  r  »  .968,  s  ■>  .168 


The  sterimol  parameter  B3  applies  only  to  substituents  in  the  meta  (3) 
position.  It  is  thought  that  the  ionized  form  of  the  sulfonamide  binds  to  a 
positively  charged  zinc  in  the  enzyme. 

OSAR  34  shows  the  effect  of  substituents  on  the  ionization  of  the 
sulfonamides. 

A  log  K  -  0.86  0  +  0.08  (34 

n  “  16,  r  ••  0.962,  s  *  0.146 

After  corrections  for  the  hydrophobic  and  steric  effects  of  substituents  in 
QSAR  33  the  p  value  is  in  good  agreement  with  that  of  eq.  34  indicating 
that  inhibitory  effects  follow  propensity  for  ionization. 

The  data  in  Table  I  represents  a  different  approach  to  enzymic  QSAR. 
Instead  of  testing  many  compounds  on  a  single  enzyme  two  substrates 
have  been  tested  with  a  variety  of  enzymes.  The  value  of  p  comes  from 
equations  of  the  type;  log  l/Km  -  po  +  lur  +  aMR  +b  where  Km  is  the 
Michaelis  constant.  Since  the  structure  of  the  active  site  of  each  enzyme  is 
different,  terms  in  it  or  MR  or  both  may  or  may  not  appear  in  the  final 
QSAR,  nevertheless,  we  see  that  there  is  little  variation  in  p.  The 
mechanism  of  action  by  the  various  enzymes  in  forming  the  ES  complex  is 
much  the  same  for  the  two  different  substrates  in  terms  of  the  electronic 
effects  of  X. 
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One  example  from  the  physical-organic  database  will  now  be 
considered.  Over  the  years  there  has  been  discussion  about  the  value  of  p 
in  the  Hammett  equation.  That  is.  can  it  be  expected  that  for  similar 
reactions  under  similar  conditions  similar  values  of  p  are  found?  No 
systematic  study  has  been  made  of  the  problem  and  few  people  have 
much  interest  beyond  their  own  narrow  compartment.  However,  in 
building  a  generalized  approach  to  QSAR  one  wants  to  wring  as  much 
information  as  possible  from  each  QSAR. 

Searching  the  physical  data  base  under  the  command  2  P2  we  obtain 
307  hits  for  QSAR  on  hydrolysis.  Then  searching  on  2  <  "  S  “  <  2.6  we 
obtain  hits  of  which  eq.  35  to  43  are  relevant. 

0 

Alkaline  aqeous  hydrolysis  X-C6H40(1-NHCH3  at  37* 

log  k  -  2.48  a  +  3.03  (35 

n  =«  6,  r  «  0.977,  s  -  0.156 

Alkaline  hydrolysis  in  70%  dioxane  X-C6H4COOC2H5  at  25’ 

log  k  -  2.26  a  -  2.47  (36 

n  -  4,  r  -  0.989,  s  -  .068 


Alkaline  hydrolysis  X-C6H4COOC2H5  in  87%  ethanol  at  30" 


(57 


log  k  »  2.51  a  ~  1.28 
n  -  18,  r  -  0.993,  s  -  0.105 
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•Alkaline  hydrolysis  of  X-C6H4-CN  in  60%  ethanol  at  82 


log  k  -  2.13  a  -  1.0 
n  -  5.  r  -  0.981,  s  -  0.184 


Alkaline  hydrolysis 


COOCH3 


in  85%  methanol  at  25* 


log  k  -  2.03  0  -  2.09 
n  -  10,  r  -  0.997,  s  -  0.083 


Alkaline  hydrolysis  of  X-C6H4COOCH3  in  85%  methanol  at  25* 

log  k  -  2.25  0  -  3.72 
n  -  10,  r  -  0.996,  s  -  0.076 


Aqueous  hydrolysis  25*  of  X-C6H40C0CH2CH2C00“ 

log  k  -  2.49  0  -  2.55 
n  -  6,  r  -  0..^69,  s  -  0.214 


Alkaline  hydrolysis 


COOC2H5 

in  85%  ethanol  at  50* 


(38 


(39 


(40 


(41 


log  k  -  2.13  o  -  2.58 
n  -  9,  r  -  0.998,  s  -  0.048 
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(42 


Alkaline  hydrolysis  X-C6H4SO2OC6H5  in  30%  dioxane  at  50 


log  k  -  2.30  0  -  3.47  (43 

n  -  6,  r  -  0.998,  s  -  0.077 

There  is  considerable  variation  in  the  basic  structures  and  reaction 
conditions  from  which  eq.  35  -  43  have  been  derived,  yet  the  electronic 
effects  of  substituents  show  a  strong  central  tendency  toward  a  p  of  2.2. 
This  rough  agreement  can  be  helpful  in  bringing  more  general  order  to 
SAR  research.  It  is  of  interest  that  both  carbamates  (eq.  35)  and  nitriles 
(eq.  38)  behave  in  a  similar  fashion  in  terms  of  p.  The  sulfonates  (eq.  43) 
have  a  p  close  to  that  of  the  carboxylates. 

These  results  do  not  mean  that  all  hydrolysis  reactions  of  esters  of 
amides  will  always  have  a  p  of  about  2.2.  For  example,  making  the  same 
kind  of  search  in  the  range  of  1.5  -  2.1  for  p  yields  the  following  relevant 
hits. 

Hydrolysis  of  X-CeH^CCXZl  in  95%  acetone  at  25' 

log  k  -  1.78  a  -  4.20  (44 

n  -  5.  r  -  0.992,  s  -  0.1 12 

Neutral  Hydrolysis  of  X-CsH^OCOH  in  10%  acetonitrile  at  25* 


log  k  -  1.59  o  +  0.49 
n  -  8,  r  -  0.988,  s  -  0.104 
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(45 


Hydrolysis  of  (X-CeH^COaO  in  75%  dioxane  at  25 ‘ 


log  k  -  1.60  0  -5.18  (46 

n  -  9,  r  -  0.986.  s  -  0.220 

Alkaline  hydrolysis  of  X-C6H4CCX)C6H5  in  60%  acetone  at  25* 

log  k  -  1.92  o  -  1.40  (47 

n  -  11.  r  -  0.983,  s  -  0.100 

Alkaline  hydrolysis  of  X-CsH^C^CCOOCaHj  in  88%  ethanol  at  20° 

log  k  -  1.91  o  -  0.15  (48 

n  -  4.  r  -  0.957.  s  -  0.231 

Hydrolysis  of  X-C6H4CX:0C1  in  65%  acetone  at  25” 

log  k  -  1.54  0  -  2.7 1  (49 

n  -  11,  r  -  0.988.  s  -  0.107 

Alkaline  hydrolysis  of  X-C6H4OCOCH3  in  56%  acetone  at  1° 

log  k  -  1.87  a  -  1.06  (50 

n  -  15.  r  -  0.974,  s  -  0.124 
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Equations  47,  48  and  50  are  just  slightly  outside  of  our  first  sweep. 
The  anhydrides  and  acid  chlorides  of  eq.  44,  46  and  49  are  so  reactive  that 
one  would  not  expect  substituent  effects  to  be  as  important  as  with  normal 
esters  and  amides.  It  is  interesting  to  see  the  hydrolysis  of  formate  esters 
(eq.  45)  under  neutral  conditions  has  lower  p  value. 

Even  though  there  is  not  enough  uniform  data  in  hand  to  evaluate 
just  how  similar  p  would  be  for  similar  reactions  this  assumption  is  clearly 
of  some  value  just  as  the  coefficients  with  log  P  are. 

Work  on  the  two  data  bases  was  started  about  1970  and  largely 
discontinued  about  1977.  In  the  last  two  years  we  have  resumed  the 
addition  of  more  data,  especially  to  the  Bio  database.  We  now  need  to 
check  all  of  the  data,  install  a  more  standardized  nomenclature  and  redo 
many  of  the  QSAR.  Only  a  few  of  the  older  QSAR  have  been  developed 
using  the  bilinear  model,  also  improved  values  of  log  P  are  now  available. 
In  a  number  of  instances  data  sets  which  are  too  small  in  terms  of  data 
points/variable  need  to  be  deleted.  Even  with  these  shortcomings  the 
present  bank  is  a  useful  tool  to  those  heavily  involved  in  structure -activity 
analysis.  However,  if  we  can  enter  essentially  all  of  the  published  QSAR 
and  keep  up  with  those  currently  being  published  we  shall  have  an 
indispensable  tool  for  all  areas  of  structure  activity  analysis.  This  would 
entail  building  a  bank  about  two  to  three  times  that  of  our  current  two 
banks  which  have  in  all  about  5500  data  sets. 

Acknowledgements:  We  are  grateful  to  R.J.  Reynolds.  Burroughs 
Welcome,  Abbott  Laboratories  and  du  Pont  for  the  support  of  this  project. 
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Table  I 


Values  for  p  in  the  correlation  of  hydrolysis  of 
X-C6H4OCOCH2NHCOC6H5  (I)  and  X-C6H4OCOCH2NHSO2CH3  (II) 


Enzyme 

Substrate 

P 

pH 

Class  of  Hydrolase 

Papain 

I 

0.57 

6 

Cysteine 

Papain 

II 

0.55 

6 

Cysteine 

Ficin 

I 

0.57 

6 

Cysteine 

Ficin 

II 

0.62 

6 

Cysteine 

Actinidin 

I 

0.74 

6 

Cysteine 

Bromelain  B 

I 

0.70 

6 

Cysieine 

Bromelain  B 

II 

0.68 

6 

Cysteine 

Bromelain  D 

I 

0.63 

6 

Cysteine 

Subtilisin 

I 

0.49 

7 

Serine 

Chymotrypsin 

I 

0.42 

6.9 

Serine 

Trypsin 

I 

SUZL 

7 

Serine 

Mean 

-0.61 
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Table  II 


Browse  Menu 


1.  System 

2.  Class 

3.  Compound 

4.  Action 

5.  Reference 

6.  Souce 


7.  Check 

8.  Note 

10.  Parameters 

11.  Substituents 

12.  Smiles 
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Table  III 


Biological  Classifications 


BO 

Uknown 

B4 

Single-Celled  Organisms 

B4A 

Algae 

B1 

Nonenzymatic 

B4B 

Bacteria 

Macromolecules 

B4C 

Cells  in  culture 

DNA.  Fibrin. 

B4E 

Erythrocytes 

Hemogiobin,  Albumin 

B4F 

Fungi,  Molds 

B4P 

Protozoa 

B2 

Enzymes 

B4V 

Viruses 

B2A 

Oxidorsductases 

B4Y 

Yeasts 

B2B 

Transferases 

B2C 

Hydrolases 

B5 

Organs/Tissues 

B2D 

Lyases 

B5G 

Gastro-intestinal  tract 

B2E 

Isomerases 

B5H 

Heart 

B2F 

Ligase,  Tiiiokinase 

B5I 

laternal/scft  organs 

B2G 

Receptors 

B5N 

Nerves,  Brain,  Muscles 

B5S 

Skin 

B3 

Organelles 

B3A 

Mitochondria 

B6 

Multi-ceiluiar  organisms 

B3B 

Microsome 

B6A 

Animal  (vertebrates) 

B3C 

Chloroplasts 

B6B 

Insect  (bugs) 

B3M 

Membranes 

B6C 

Cancer 

B3S 

Synaptosomes 

B6H 

Human 

B6I 

Invertebrates 

B6P 

Plant 
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Table  IV 


Physical  Classifications 


PO 

Unknown 

P7 

Addition 

P7D 

Dimerization 

PI 

Ionization 

P7E 

Electrophilic  addition 

PIP 

Ionization  potential 

P7N 

Nucleophilic  addition 

PIX 

Proton  exchange 

P7P 

Polymerization 

P2 

Hydrolysis 

P8 

Elimination 

P9 

Rearrangement 

P3 

Solvolysis 

PIO 

Oxidation 

P12 

Radical  reactions 

P4 

Spectra 

F13 

Complex  formation 

P4I 

Ionization  spectra 

P4E 

ESR  spectra 

P14 

Partitioning 

P4M 

Mass  spectra 

PMC 

Chromatography 

P4N 

NMR  spectra 

P4R 

IR  spectra 

P15 

Pyrolysis 

P4U 

UV  spectra 

P16 

H-bonding 

P17 

Electrochemical 

P5R 

Miscellaneous  reactions 

P18 

Bronsted 

P19 

Esterification 

P6 

Substitution 

P20 

Photochemical 

P6E 

Electrophilic  substit. 

P21 

Hydrogenation 

P6N 

Nucleophilic  substit. 

P22 

Isokinetic 
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Table  V 


Search  Menu 


1.  System 

2.  Class 

3.  Compound 

4.  Action 

5.  Reference 

6.  Source 

7.  Check 

8.  Note 

9.  Date 

10.  Parameters 


11.  Substituents 

12.  SMILES 

14.  Prm  max/ min 

15.  Terms  in  eqn. 

16.  Coefs.  in  eqn. 

17.  I  deal/ log  B 

18.  Statistics 

19.  Residuals 

20.  Predicted 
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Table  VI 


Show  Menu 


0.  ID  numbers  only 

1.  System 

2.  Class 

3.  Compound 
4-  Action 

5.  Reference 

6.  Source 

7.  Check 

8.  Note 

9.  Date 

10.  Parameters 


11.  Substituents 

12.  SMILES 

13.  Parameter  value 

14.  Prm  max/ min 

15.  Terms  in  eqn. 

16.  Coefs.  in  eqn. 

17.  Ideal/log  B 

18.  Statistics 

19.  Residuals 

20.  Predicted 


Recent  Progress  and  Problems  in  Calculating  Log  Poet 

by  Albert  J.  Leo 

Pomona  College  Medicinal  Chemistry  Project 


The  early  algorithms  used  to  calculate  log  Poet  from  structural  ‘fragments’ 
yielded  values  for  very  lipophilic  solutes  (>6.0)  which  were  often  much 
higher  than  those  measured.  For  example,  DDT  was  calculated  as  6.91,  but 
in  1973  Metcalf  and  Lu,  using  a  radiolabeled  solute,  measured  it  at  3.98. 
Seeing  no  reason  for  the  calculations  to  be  so  greatly  in  error,  we  resisted 
the  temptation  to  insert  in  the  algorithm  some  ‘tailing-off  factor’  if  more 
than  a  certain  number  of  hydrophobic  fragments  were  added  together. 
Waiting  for  the  analytical  methods  to  improve  paid  off,  for,  as  Fig.  1  shows, 
the  most  reliable  technique  (slow-stir)  matches  that  from  the  CLOGP 
algorithm  for  DDT  (6.91)  and  is  very  close  for  DDE  (6.96  vs.  6,94). 


Tetrahydrocannabinol  (A-9-THC)  is  another  solute  that  seemed  to  require 
an  algorithm  change  to  bring  calculated  value  into  agreement  with 
measurment.  Early  measurements  put  its  Log  Poet  slightly  below  4.0, 
while  it  is  calculated  by  CLOGP  as  7.24.  Martin  and  co-workers  found  they 
could  reliably  measure  the  more  hydrophilic  analogs  of  THC  by  normal 
shake-flask  technique,  but  they  had  great  difficulty  with  those  as 
hydrophobic  as  the  parent.  Using  reversed  phase  HPLC  with  a  C-8  column, 
they  used  known  standards  to  establish  a  relationship  between  K’  and  log 
Poet-  As  seen  in  Fig  2,  they  found  a  good  relationship  between  CLOGP  and 
Log  Phplc  values  with  hydrophobic  analogs  up  to  log  11.5.  It  appears, 
then,  that  as  a  model  for  this  type  of  hydrophobic  behavior  (partial  de¬ 
solvation  as  on  a  hydrocarbon-coated  HPLC  support),  CLOGP  values  can  be 
used  to  an  upper  limit  of  at  least  12.0. 


The  more  recent  versions  of  CLOGP  contain  an  ortho  correction  (-0.12)  for 
adjacent  chlorines  on  an  aromatic  ring.  As  seen  in  Figs.  3  and  3a.  this 

becomes  important  with  multiple  substitution.  For  polychlorobiphenyls 

(Figs.  4.  &  4a.)  a  further  negative  correction  factor  (-0.25)  is  required  for 
chlorines  in  the  ortho  position  to  the  biphenyl  bond.  These  empirically- 

derived  correction  factors  agree  quite  well  with  the  lower-than-normal 
solvent  accessible  surface  area  (SASA)  as  calculated  by  the  SAVOL 
program^.  However,  it  is  also  known  that  in  hexachlorobenzene  the 


139 


chlorinr  atoms  are  forced  slightly  out  of  plane^,  and  the  negative 
correction  may  in  part  arise  from  less  delocalization  of  the  C-Cl  bonds.  In 
any  case,  the  empirical  corrections  allow  CLOGP  to  calculate  20  PCBs  with 
an  average  deviation  of  only  0.07. 


In  Fig.  4a  it  can  be  seen  that  for  the  last  three  analogs  (Cpds.  21,  22,  &  23) 
CLOGP  overpredicts  by  successively  greater  amounts.  It  seems  very  likey 
(as  proposed  by  DeBruijn  &  Hermans  who  provided  the  slow-stir  data^) 
that  the  octanol  present  in  the  saturated  water  phase  is  acting  as  a 
“surfactant”  which  solubilizes  very  lipophilic  solutes  by  some  pre-micellar 
mechanism.  If  the  behavior  being  modelled  may  parallel  this  action,  then 
CLOGP  values  above  log  7.5  should  be  reduced  in  a  manner  that  places  an 
asymtotic  upper  level  of  about  9.0. 


Halogens  attached  to  the  same  or  adjacent  aliphatic  carbon  atoms  are  more 
lipophilic  than  those  isolated  from  each  other.  CLOGP  uses  a  topological 
measure  of  this  separation,  and  it  has  been  known  for  some  time  that 
information  of  actual  spatial  separation,  as  obtained  from  a  molecular 
mechanics  program  such  as  CONCORD^  would  be  useful.  The  empirical 
correction  factor  devised  for  CLOGP  works  well  for  many  multi- 
halogenated  aliphatics,  such  as  the  four  hexachlorocyclohexanes  as  seen  in 
Fig.  5.  However,  in  aldrin  two  of  the  chlorines  crowded  on  the  bridged 
rings  have  a  topological  separation  of  three  carbons,  but  they  are  actually 
as  close  as  those  separated  by  only  two.  If  CLOGP  was  using  data  based  on 
spacial  distance  (instead  of  Cl-C-C-Cl),  it  would  yield  a  value  of  6.0  which  is 
better  but  still  considerably  low.  Note  that  crowding  of  chlorines  on  an 
aliphatic  framework  reduces  the  effective  SASA  for  each  atom,  but  the 
effect  is  to  make  the  solute  more  hydrophobic  than  expected,  whic!  is 
opposite  from  that  of  the  halo-aromatics. 


The  negative  correction  employed  by  CLOGP  for  the  branching  of  alkyl 
chains  was  always  thought  to  be  based  on  a  sound  theoretical  basis: 
namely,  that  branching  reduced  solute  size.  Using  four  types  of  size 
measurements  as  provided  by  SAVOL  (“bare”  area  or  volume;  or  the  area 
or  volume  as  covered  with  a  1.5  A°  layer).  Fig.  6  shows  that  the 
correlations  with  the  normal  alkanes  is  almost  perfect  (r^  >  0.995).  Adding 
branched  and  cyclic  alkanes  leaves  only  cyclopropane  as  an  outlier,  which 
is  not  unexpected  in  view  of  the  polarity  of  the  strained  carbon  bonds. 
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This  work  impelled  us  to  remeasure  2,3-dimethylbutane,  as  it  appeared  as 
a  mild  outlier.  Using  this  new  more  reliable  value,  it  appears  that  Eq.  C4a 
is  preferred  to  relate  measured  log  P  of  alkanes  to  their  size,  namely  the 
surface  area  with  a  1.5  A®  ‘sweater’. 


The  attempt  to  relate  ‘polar  group  branching’  to  solute  size  runs  into 
difficulty,  as  seen  in  Fig.  7  with  the  four  isomeric  butanols.  After  overall 
size  is  accounted  for  by  using  Eq.  C2  from  Fig.  6,  then  most  of  the  reduction 
of  0.53  in  log  P  (0.88  to  0.35)  still  must  come  from  the  changed  polarity  of 
the  hydroxyl  group;  i.e,  it  must  account  for  a  lowering  of  -2.36  to  -2.76  in 
log  P.  An  increase  in  hydrogen  bond  basicity,  p,  could  account  for  this,  but 
the  data  now  available  show  only  a  slight  change  (0.45  to  0.49)  in  this 
parameter.  Hydrogen  bond  acidity,  a,  is  not  thought  to  change  for  these 
isomers.  Accounting  for  just  the  size  difference  of  the  hydrocarbon  portion 
of  these  butanols  leaves  even  a  larger  apparent  increase  in  hydroxyl 
polarity  to  be  explained. 


The  early  data  for  polar  group  branching  on  alkanes  was  provided  largely 
by  the  alcohols  and  amines.  As  more  solute  types  were  measured  to 
determine  the  relationship  between  log  P  and  normal  vs  branched  SASA,  it 
was  discovered  that  the  negative  branching  correction  for  nitriles, 
carboxylic  acids,  and  di-branchcd  ketones  was  either  very  small  or 
completely  negligible.  See  Fig.  8.  Reasons  for  this  behavior  are  still  being 
sought. 
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DDT  &  ANALOGS 


DDT 

CLOGP  6.91 

MEAS.  3.98(Lu  &  Metcalf;73)  Radiolabel 
5.76(Fujita)  Std.  Shake-Flask 
6.36(Karickhoff,'79)  Re-extraction 
6.60(Briggs) 

6.91(de  Bruijn  &  Hermens)  Slow-Stir 


DDE 

a.OGP  6.94 
MEAS.  6.96 


FIGUHS  1 
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omim/nmu-mtKxmt 

TW  40t'WUt,  e»  rttUMVCOMST  US  KxPISIMSNTU  TMCSAmiK* 


v«i  as.  N«.  t 
U.S.4. 


Characterization  of  the  Lipophilicity  of  Natural  and  Synthetic 
Analogs  of  eyahvdrocannabinol  and  Its  Relationship  to 
Pharmacological  'Potency ' 


BRIAN  F.  THOMAS.  DAVID  R.  COMPTON  and  BIUY  R.  MARTIN 

Dtfiaitmtnt  e!  Ptmrmeotogf  and  Toxicology,  Madieal  Codaga  el  VkgMayifginia  Commomraadn  Unvamty.  Richmond,  Virginia 
AoHp<adtarpuMo«tionAu0uat7. 1990 


<  «  »  If  12 

log  Po/W  (IIPLC) 


Rg.  Relationship  between  experimentally  determined  log  Po/w  values 
oy  HPLC  and  computer-calculated  log  Po/w  values  for  cannabinokJ 
analogs.  Correlation  coefflcient «  0.93. 

FIGURE  2 
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Proximity  Corrections:  Chlorobenzenes 


Cl 

4.  1,2,3,4-Tetra  4.63  4.64'  0.01 

F  =  -0.36  4.55^ 


5.  1,2,4,5-Tetra  4.75  4.60'  -0.15 

F  =  -0.24  4.70'‘ 

(l)DeBruijn,  slow-stir.  (2)  Shake-flask,  Pom.MedChem. 
(3)Generator  Column.  (4)  Chiou,  G. 


FIGDSE  3-1 
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Proximity  Corrections:  Chlorobenzenes  (cont) 


Cl  CLOGP  Meas.  Diff. 


Cl 


(1)  DeBruijn,  slow-stir.  (2)  Shake-Flask,  PomMedChem. 
(3)  Generator  Column.  (4)  Chiou,  G.  (5)  Platford,  octanol 
as  bulk  phase  vs.  surface  film 


PIGOKE  3-2 


Chlorobiphenyls:  Proximity  &  Ortho  Corrections 

Prox. 


Ortho  =  -0.25  each 
Prox.  =  -0.12  each 


Substituents 

Meas. 

Calc. 

Ortho 

Prox. 

1.  2.C1 

4.53 

4.49 

+0,04 

-0.25 

0.00 

2.  2,4-Cl2 

5.16 

5.21 

-0.05 

-0.25 

0.00 

3.  4-Cl 

4.61 

4.74 

o 

1 

0.00 

0.00 

4.  3,4-Cl2 

5.29 

5.34 

-0.05 

0.00 

-0.12 

5.  2,6-Cl2 

4.98 

4.96 

+0.02 

-0.50 

0.00 

6.  2,3,4-Cl3 

5.86 

5.68 

+0.16 

-0.25 

-0.24 

7.  2,4,6.Cl3 

5.71 

5.67 

+0.04 

-0.50 

0.00 

8.  2,4,5-Cl3 

5.90 

5.80 

+0.10 

-0.25 

-0.12 

9.  2,3,4,5-Cl4 

6.41(1 

6.27 

+0.14 

-0.25 

-0.36 

10.2,3,4,5,6-C1s 

6.74 

6.62 

+0.12 

-0.50 

-0.48 

11.2,2'-Cl2 

4.97 

4.96 

+0.01 

-0.50 

0.00 

12.  2,3,2’-Cl3 

5.31(2 

5.55 

-0.24 

-0.50 

-0.12 

13.  2,3,4,2’-Ci4 

6.11 

6.14 

-0.03 

-0.50 

-0.24 

(1)  Not  by  slow-stir;  6.18  by  generator  column. 

(2)  By  HPLC 

FICURE  4-1 
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Chlorobiphenyls:  CLOGP  Calc.  Continued. 


SttbsJtituents 

Mgas. 

Ortho 

Prox, 

14. 

2,3,2S3’-Cl4 

6.18 

6.14 

+0.04 

-0.50 

-0.24 

15. 

2,6,2’,6*-Cl4 

5.94 

5.88 

+0.06 

-1.00 

0.00 

16. 

2,4,2',4’-Cl4 

6.29(2  6.38 

-0.07 

-0.50 

0.00 

17. 

2,2’,4,5’-Cl4 

6.36 

6.38 

-0.02 

-0.50 

0.00 

QO 

BiS’‘2y396*'d5 

7.12 

7.07 

+0.05 

-1.00 

-0.24 

19. 

Bis-2,4,6-Cl6 

7.29 

7.31 

+0.02 

-1.00 

0.00 

20. 

Bis-2,3,4-Cl6 

7.32 

7.33 

-0.01 

-0.50 

-0.48 

sit 

21. 

Bis-3,4,5-Cl6 

7.41 

7.83 

-0.42 

0.00 

-0.48 

22, 

Bis2,3,5,6Cl8 

7.73  . 

8.25 

-0.52 

-1.00 

-0.48 

23.  Decachloro 

8.27 

9.20 

-0.93 

-1.00 

-0.96 

(2)  Not  by  slow-stir. 


FIGURE  4-2 


HEX  ACHT  DRQCYCLOHEXANES 


CLOGP  =  3.75 


MEASURED: 

1.  Alpha 

3.80 

2.  Beta 

3.78 

3.  Gamma 

3.72 

4.  Delta 

4.14 

ALDRIN 

MEAS.  6.50 

CLOGP  5.41 


AS  AS  A/CL(in  Aldrin)=  15.17 
ASASA/CL(in  MeCl)  =  17.39 

FIGCIE  5 
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Relating  SAVOL  to  Log  P 


Volumes  with  and  without  1.5 A®  surface; 
Surface  Areas  (ditto) 

HYDROCARBONS: 


Vi.^and  V® 
Al'*  and  A® 


A. 

Straight  Chain: 

Methane  to 

Hexane: 

1. 

Log  P  X 

0.033 

V®  +0.256; 

n«6;  r2s0.996;  s«0.G76 

2. 

Log  P  X 

0.010 

VL5. 0.497; 

n»6;  r2*0.998;  s=0.055 

3. 

Log  P  X 

0.026  A®  -0.043; 

n=6;  r2*0.996;  saO.071 

4. 

Log  P  8 

0.017 

A»S-1.452 

n=6;  r2=0.999;  S80.036 

B. 

Straight  & 

Branched  Chains: 

Above  +  i-Butane,  2,3-11 

Butane  &  Neopentane 

1. 

Log  P  8 

0.032 

V®  +0.286; 

ns9;  r2=0.988;  8=0.110 

2. 

Log  P  a 

0.010 

VL5. 0.537; 

n*9;  r2«0.992;  s*0.091 

3. 

Log  P  X 

0.026 

A®  -  0.057; 

n«9;  r2=0.986;  s=0.115 

4. 

Log  P  8 

0.018 

AL5. 1,583; 

n=9;  r2=0.987;  s=0.114 

C.  Inci.  Cyclic  Hydrocarbons:  Cy-propane  (1>72)  an  Outlier 

1.  Log  P  »  0.032  V® +0.299;  n=»ll;  r2«0.987;  s»0.101 

Pred.LogP  L3TJ  »  2.02;  dev.=  -0.305 

2.  Log  P  a  0.010  V»-5. 0.538;  n»ll;  r2=0.992;  s=0.081 

2.04;  dev.s-0.317 

3.  Log  P  =  0.026  A®  -0.058;  nxU;  r2=0.985;  s=..109 

1.99;  dev.  =  -0.270 

4.  Log  P  a  0.018  AL5. 1.598;  n=sll;  r2=0.987;  s=0.103 

,  *  2.07;  dev.  =  -0.351 

LoO’P  «  0^(7  /,4^.  n  •(<;  •**  s  “ 

um  a®’ 

FIOUBE  6 
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Relating  SAVOL  to  Log  P 


Volumes  with  and  without  1.5Ao  surface:  V® 

Surface  Areas  (ditto)  A^**  and  A® 

Butanols 

A.  Using  £q.C*2  for  Total  Solute  Volume: 

Log  P  =  0.010  Vi  s  .  0.538 


§L 

1.Q4  (M=0.88) 

3.775-0.538 

r  3.24;  -2.36; 

.45 

2.Q1Y  (M=0.76) 

3.690-0.538 

=  3.15;  -2.39; 

.45 

3.QY2(M=:0.61) 

3.700-0.538 

s  3.16;  -2.55; 

.47 

4.QX  (M=0.35) 

3.650-0.538 

s  3.11;  -2.76; 

.49 

B.  Using  Eq.C-4  for  Area  of  Hydrocarbon  Only 
Log  P  =  0.018  Al  S  .  1.60 


I.Q4 

(MsO.SS) 

(210.2)0.18  -1.6  =  2.18; 

-1.30; 

.45 

2.Q1Y 

(MaO.76) 

(208.7)0.18  -1.6  =  2.16; 

-1.40; 

.45 

3.QY2 

(M=0.61) 

(218.8)0.18  -1.6  =  2.34; 

-1.73; 

.47 

4.QX 

(MsO.35) 

(212.1)0.18  -1.6  =  2.22; 

-1.87; 

.49 

FIGDKE  7 
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Relating  SAVOL  to  Log  P 


Volumes  with  and  without 

1.5A°  surface: 

V 1*®  and 

VO 

Surface  Areas  (ditto) 

A^*®  and 

A® 

Using  £q.  C-2  with  Total 

Solute  Volume: 

Pentanoic  Acids: 

Effect  C02H 

1.  QV4  (M=1.39) 

4.355  -0.538  =  3.82; 

-2.43 

2.  QV1Y(M=1.16) 

4.272  -0.538  =  3.73; 

-2.57 

3.  QVY2(M=1.18) 

4.292  -0.538  =  3.75; 

-2.57 

4.  QVX  (Mai. 47) 

4.196  -0.538  a  3.66; 

-2.19 

For  All:  a  =  0.54|  p  *  0.41 

Effect  aO;  p 

5.78;  -2.71  .48 

5.00;  -2.00  .46 


FIGOBK  8 


Nonanones: 

1.  4V4  (M=2.97)  6.315  -0.538  = 

2.  t-Bu  (Ma3.00)  5.540  -0.538  = 


151 


BLANK 


152 


APPENDIX  A 
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ORGANIZATIONS  OF  AUTHORS  IN  THESE  PROCEEDINGS 


Central  Michigan  University  49 

Institute  de  Quimica  Fisica  33 

La  Sierra  University,  Riverside  71 

Pomona  College  105,139 

Universite  de  NANTES  13 

University  of  California,  Irvine  9 

University  of  New  Orleans  77,85 

U.S.  Army  Chemical  Research,  Development  and 

Engineering  Center  7,65,71 
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APPENDIX  C 


Meeting  on  Solute/Solvent  Interactions 
Agenda 


Wednesday,  29  May  Seminar  Center 

0800-0815  Welcome/Introductioii 

Equilibrium 

0815-0845  R.  Taft 

08454)925  I.  Koppel 

0925-1000  M.  Berthelot 

1000-1030  J.  Abboud 

1030-1100  J.  Lorand 

Theoretical 

1100-1130  P.  Politzer 

1130-1200  G.  Famini 

1200-1230  L.  Wilson 

1400-1700  Demonstrations,  E3160 

Thursday,  30  May  Seminar  Center 

New  Approaches 

0900-930  C.  Hansch 

0930-1000  A.  Leo 

1000-1030  S.  Karickhoff 

1030-1100  L.  Carreira 

Wednesday  Afternoon 

Polymers 

1300-1350  M.  Abraham 

1350-1420  M.  Harris 

1420-1450  J.  Grate 

1450-1520  W,  Shuely 

1520-1550  A.  McGill 
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